AI Infrastructure Engineer

Priya Mehra

Full-time · Mid-level · New_York

About the role

Our researchers are brilliant. Our infrastructure is not. Training runs die halfway through, experiment results get lost, and deployments happen over Slack messages. We need an AI infrastructure engineer to fix this — not as a support function, but as a core part of how we do research. If you enjoy bringing order to technical chaos and want to work alongside serious ML scientists, this is the right place.

Responsibilities

Build and maintain GPU training infrastructure on AWS
Set up and improve experiment tracking with MLflow
Design model versioning and registry workflows
Build CI/CD pipelines for model deployment
Monitor system health and resolve infrastructure incidents

Requirements

Experience building ML infrastructure (training pipelines, model registries, serving)
Strong Python and familiarity with Docker and Kubernetes
Hands-on with at least one ML experiment tracking tool (MLflow, W&B, Neptune)
Cloud infrastructure experience (AWS, GCP, or Azure)
Able to work closely with research scientists and translate their needs into systems

Benefits

Work alongside world-class researchers
Full remote
Equity package
Top-tier hardware
Conference and publication support

Job Type

Full-time

Level

Mid-level

Language

English

Salary Range

$115,000 – $150,000

AI Expertise

AI & Machine Learning Engineers

Ready to apply for this role?

Create a free talent account in under 2 minutes.

Apply to verified AI companies
Get AI-matched job recommendations
Message hiring managers directly
Build your public AI talent profile

Create free account & apply Log in