Software Engineer (Data Infrastructure, Aarhus)

Aarhus, Central Denmark
Posted 1 month, 1 week ago
Software Development

About the role

Job summary

This role focuses on data collection to support model training operations within an AI team. The position involves building high-quality datasets at a large scale through a combination of engineering and research efforts.

Qualifications

  • BS/MS/PhD in Computer Science or a related field.
  • Over 5 years of experience in software development.
  • Proficient in bash/Python scripting within Linux environments.
  • Experienced with Docker and Infrastructure-as-Code, particularly with a major Cloud Provider (GCP preferred).
  • Familiarity with web crawlers and large-scale data processing workflows is advantageous.
  • Strong communication skills, both written and verbal.

Responsibilities

  • Identify and source new audio data for the ingestion pipeline.
  • Manage and enhance the cloud infrastructure for the ingestion pipeline using GCP and Terraform.
  • Collaborate with scientists to optimize data cost, throughput, and quality for model training.
  • Work with the AI Team and leadership to develop the dataset roadmap for future products.

Skills

  • Proficiency in cloud infrastructure management.
  • Strong problem-solving abilities and adaptability to changing priorities.

Education

  • Relevant degree in Computer Science or a related discipline.

Tools

  • GCP, Terraform, Docker, bash, Python.
Full Access

Ready to apply for this role?

Full Access gives you the company name, full job description, and a direct link to apply. The summary above helps you explore the role.

Share this job