AI Trainer — RLHF & Human Feedback Collection

Anya Petrov

Freelance · Mid-level

About the role

Unusual role — we'll explain what it actually is. We're running RLHF on a language model and we need human feedback collectors who are genuinely good at thinking critically about text quality. You'll read model outputs side by side, rank them, explain your reasoning, write better versions when both options are bad, and flag things that are subtly wrong in ways that matter. This is not busy work. Bad feedback trains a worse model. We need people who care about the quality of what they produce. No ML background required, but you need to be a strong writer who can articulate why one response is better than another — not just "this one sounds better." Domains covered: coding, writing, factual QA, reasoning. You don't need to be an expert in all of them.

Contract Type

Hourly rate

Level

Mid-level

Budget Range

$30 – $50 / hour

Duration

2–3 months

AI Expertise

AI & Machine Learning Engineers NLP & Prompt Engineering

Ready to apply for this role?

Create a free talent account in under 2 minutes.

Apply to verified AI companies
Get AI-matched job recommendations
Message hiring managers directly
Build your public AI talent profile

Create free account & apply Log in

AI Trainer — RLHF & Human Feedback Collection

Apply for AI Trainer — RLHF & Human Feedback Collection

About the role

AI Expertise

Apply for
AI Trainer — RLHF & Human Feedback Collection