ML Platform Engineer

Isabella Gomez

Full-time · Mid-level · Denver

About the role

We are the company behind an open source ML experiment tracking and model registry tool with 9,200 GitHub stars and a contributor community of 340 people across 41 countries. Three years ago we were a side project. Two years ago we incorporated. Last year we launched our managed cloud offering and crossed $2M ARR. Our business model is open core: the library stays free and MIT-licensed forever, and we sell managed hosting and enterprise features to organisations that want to run it at scale. The engineering culture here is shaped by the open source project. We write in public. We discuss architectural decisions in GitHub issues before we make them. We merge pull requests from contributors we've never met based on code quality and rationale alone. If that sounds uncomfortable, this is probably not the right place. If it sounds like the way software should be built, you'll feel at home immediately. We're looking for an ML Platform Engineer to work on the infrastructure that powers both the open source project and the managed cloud service. The work spans Kubernetes, CI/CD, observability, and the specific infrastructure challenges of a tool that needs to work identically whether a user runs it on a laptop or at Fortune 500 scale.

Responsibilities

Own and improve the Kubernetes infrastructure powering our managed cloud service across GCP
Design and implement CI/CD pipelines for the open source library and the managed product
Build observability into the platform: metrics, tracing, alerting, and on-call runbooks
Contribute to the open source SDK — your platform knowledge should improve the developer experience of the library itself
Review infrastructure contributions from community members and provide constructive technical feedback

Requirements

3–5 years of platform, infrastructure, or MLOps engineering
Kubernetes — you design multi-tenant workloads, write Helm charts, and debug networking issues without the docs open
Docker — images, multi-stage builds, registry management, and security scanning
GCP — GKE, Cloud Run, Cloud Storage, and IAM — you've run production workloads, not just development environments
CI/CD pipeline design and ownership — GitHub Actions at a minimum, experience with release automation and versioning
Python for infrastructure tooling, automation scripts, and SDK contributions
A genuine history of open source participation — contributed code, filed thoughtful issues, or maintained a project

Benefits

Your work is public — the infrastructure decisions you make are visible to thousands of ML engineers
Full remote, async-first — most of our community is in time zones we'll never share anyway
$95,000 – $118,000 base salary + equity
$1,500 annual open source conference budget
GitHub history matters more than your CV

Job Type

Full-time

Level

Mid-level

Language

English

Salary Range

$95,000 – $118,000

AI Expertise

MLOps & AI Infrastructure

Ready to apply for this role?

Create a free talent account in under 2 minutes.

Apply to verified AI companies
Get AI-matched job recommendations
Message hiring managers directly
Build your public AI talent profile

Create free account & apply Log in

ML Platform Engineer

Report this job

Report submitted

Apply for
ML Platform Engineer

About the role

Responsibilities

Requirements

Benefits

AI Expertise

ML Platform Engineer

Report this job

Report submitted

Apply for ML Platform Engineer

About the role

Responsibilities

Requirements

Benefits

AI Expertise

Apply for
ML Platform Engineer