We are seeking an experienced Machine Learning Data Infrastructure Engineer to join our dynamic team. This role involves designing, building, and maintaining robust data infrastructure, pipelines, and storage systems crucial for our Machine Learning (ML) initiatives. If you are passionate about leveraging cutting-edge technologies to empower data-driven decision-making and have a strong background in data engineering and cloud platforms, we encourage you to apply.
Responsibilities
- Design and Build Data Infrastructure. Architect scalable and efficient data infrastructure solutions to support ML data processing and analytics.
- Develop Data Pipelines. Create and maintain reliable data pipelines for ingesting, transforming, and storing large volumes of data from diverse sources.
- Manage Data Storage Systems. Implement and optimize data storage solutions, ensuring high performance and reliability.
- Programming Expertise. Utilize Python, Java, or Scala for developing data processing algorithms and optimizing data workflows.
- Utilize Messaging Systems. Experience with Kafka and Confluent Kafka for real-time data streaming and event-driven architectures.
- Cloud Platform Integration. Work proficiently with Azure and GCP cloud platforms, leveraging their services for scalable data solutions.
- Database Management. Hands-on experience with managing databases such as Postgres and BigQuery, ensuring data integrity and performance.
- Infrastructure as Code. Implement infrastructure as code using Terraform to automate provisioning and deployment processes.
- Support Data Science Teams. Collaborate with data science teams, setting up and managing data within Databricks to facilitate AI model development and deployment.
Requirements
- Education. Bachelor’s degree in Computer Science, Engineering, or related field (Master’s preferred).
- Experience. Minimum 5 years of experience in designing and implementing data infrastructure and pipelines, preferably in a machine learning or data-intensive environment.
Technical Skills
- Proficiency in Python, Java, or Scala.
- Strong understanding of Kafka and Confluent Kafka.
- Hands-on experience with Azure and GCP cloud platforms.
- Solid knowledge of database systems like Postgres and BigQuery.
- Familiarity with Terraform for infrastructure automation.
- Experience with Databricks or similar platforms for AI model development.
Soft Skills
- Strong analytical and problem-solving abilities.
- Excellent communication and teamwork skills.
- Ability to work effectively in a fast-paced, collaborative environment.
Why Join Us?
- Opportunity to work at the forefront of machine learning and data infrastructure.
- Collaborative environment that encourages innovation and professional growth.
- Competitive compensation package and benefits.
How to Apply
If you are excited about transforming data into actionable insights and meet the qualifications above, please submit your resume and cover letter detailing your relevant experience and why you would be a great fit for this role.