RLHF Pay Rates: What Companies Pay for Human Feedback in 2026
RLHF Pay Rates: What Companies Pay for Human Feedback in 2026
Reinforcement Learning from Human Feedback (RLHF) remains the backbone of how AI models are trained to be helpful, harmless, and accurate. As the industry matures, RLHF pay rates have stratified significantly — with general raters earning $15-30/hr while expert evaluators earn $80-200+/hr. Here's the complete breakdown of what different RLHF roles pay across major platforms in 2026.
RLHF Pay Rate Overview
The single biggest factor in your RLHF rate isn't the platform you choose — it's the type of RLHF work you qualify for. The market has split into distinct tiers.
| RLHF Tier | Pay Range | Requirements | Task Complexity |
|---|---|---|---|
| General Rating | $15-30/hr | Basic literacy, attention to detail | Simple comparisons, binary choices |
| Skilled Rating | $25-50/hr | College education, strong writing | Detailed comparisons with rationales |
| Expert Rating | $50-100/hr | Professional expertise, domain knowledge | Domain-specific evaluation, quality writing |
| Specialist Rating | $80-200+/hr | Advanced degree, proven expertise | Complex reasoning, expert generation |
Pay Rates by Platform
Mercor
| RLHF Role | Pay Range | Availability |
|---|---|---|
| General RLHF Trainer | $25-50/hr | High |
| Code RLHF (SWE) | $60-200/hr | High |
| Medical Expert RLHF | $100-250/hr | Moderate |
| Legal Expert RLHF | $80-200/hr | Moderate |
| Creative/Writing RLHF | $25-60/hr | Moderate |
Mercor has the widest range of RLHF rates, reflecting their breadth of projects. Their AI matching system effectively routes specialists to high-paying domain tasks. Read our Mercor hiring guide for application tips.
Braintrust
| RLHF Role | Pay Range | Availability |
|---|---|---|
| Expert Evaluation | $70-150/hr | Moderate |
| Code Review RLHF | $80-200/hr | Moderate |
| Domain Expert RLHF | $80-200/hr | Low-Moderate |
Braintrust doesn't offer entry-level RLHF work — they focus on expert-tier tasks. The zero platform fee structure means these rates represent your actual take-home (minus taxes). If you qualify, Braintrust typically pays the highest effective rates.
Scale AI / Outlier
| RLHF Role | Pay Range | Availability |
|---|---|---|
| General Rating | $15-30/hr | Very high |
| Skilled Rating | $25-50/hr | High |
| Expert Rating | $40-80/hr | Moderate |
| Code Expert | $40-100/hr | High |
Scale AI (and their Outlier AI subsidiary) offers the highest volume of RLHF work. Rates are lower than Mercor or Braintrust, but task availability is nearly unlimited for qualified workers. This makes Scale AI the best option for full-time RLHF work where consistent hours matter more than peak rates.
Other Platforms
| Platform | RLHF Pay Range | Notes |
|---|---|---|
| Appen | $15-40/hr | Good for beginners, consistent work |
| Remotasks | $10-35/hr | High volume, variable quality |
| Toloka | $10-30/hr | Micro-task focused |
| Prolific | $15-40/hr | Research-oriented, academic studies |
Platform Stacking
The most profitable RLHF strategy is working across multiple platforms: use Mercor or Braintrust for premium expert tasks, and Scale AI for filling available hours with consistent work. See our multi-platform guide.
Pay Rates by Domain Expertise
Your background determines which RLHF tier you can access. Here's what different professionals earn:
STEM Professionals
- Software engineers: $50-200/hr (code-focused RLHF)
- Math/physics PhDs: $80-200/hr (reasoning evaluation)
- Data scientists: $50-120/hr (technical evaluation)
- Biologists/chemists: $60-150/hr (scientific accuracy)
Healthcare and Legal
- Physicians: $100-250/hr (clinical reasoning)
- Pharmacists: $70-150/hr (drug information)
- Attorneys: $80-200/hr (legal analysis)
- Nurses: $40-80/hr (patient care scenarios)
Business and Finance
- Financial analysts: $50-150/hr (financial reasoning)
- Accountants: $40-100/hr (tax and compliance)
- Consultants: $50-120/hr (business strategy)
Creative and Language
- Professional writers: $30-70/hr (content quality)
- Editors: $25-60/hr (grammar, tone, clarity)
- Translators: $25-60/hr (multilingual RLHF)
- Linguists: $30-70/hr (language quality)
What Determines Your RLHF Rate
Assessment Performance
Your initial platform assessment is the primary rate-setter. A mediocre assessment score on Mercor might place you at $30/hr, while an excellent score on the same platform gets you $60/hr for the same task type. Preparation is the highest-ROI activity for increasing your RLHF earnings.
Quality Scores
Every platform tracks your work quality. Scores above 95% unlock premium project tiers on most platforms. Scores below 85% restrict you to basic tasks and may eventually lead to deactivation. The difference between 90% and 97% quality can represent $20-40/hr in rate differences.
Domain Scarcity
Simple supply and demand. There are thousands of general RLHF raters and relatively few board-certified cardiologists willing to do contract AI work. If your expertise is rare and in demand, your rate reflects that scarcity.
Hours and Reliability
Contractors who maintain consistent availability (15+ hrs/week) and high reliability (completing accepted tasks on time) often receive preferential rates and project access. Sporadic workers get lower-priority matching.
RLHF Pay Trends: 2024 vs 2026
The RLHF market has shifted notably over the past two years:
- General rater pay has compressed. Entry-level rates dropped from $20-35/hr to $15-30/hr as the supply of general raters grew.
- Expert rates have increased. Specialist RLHF rates climbed from $60-150/hr to $80-200+/hr as companies invest more in model quality.
- The middle is thinning. The $30-60/hr skilled rater tier has fewer opportunities as platforms polarize between high-volume basic work and premium expert work.
- Domain specificity pays more. The premium for specialized knowledge has grown from 2x to 3-4x the general rate.
The trend is clear: investing in domain expertise and quality scores is the path to earning growth in RLHF.
Rate Compression Warning
If you're earning $20-35/hr doing general RLHF, your rate is at risk of further compression. The best defense is moving into specialized evaluation where your unique knowledge creates value that can't be easily replaced. See how to negotiate higher pay.
How to Maximize RLHF Earnings
- Specialize. General rating pays the least. Find the RLHF niche that matches your strongest expertise.
- Score high on assessments. Prepare thoroughly — your assessment score sets your rate floor.
- Maintain 95%+ quality. This is the threshold that unlocks premium tasks on most platforms.
- Work across platforms. Use 2-3 platforms to maximize available hours and rate-shop for each task type.
- Build speed without sacrificing quality. Experienced raters complete tasks 2-3x faster than beginners at the same quality level, effectively doubling their hourly rate.
- Invest in domain credentials. Certifications, publications, and verifiable expertise translate directly to higher rates.
Browse RLHF positions or read our complete RLHF training guide to deepen your understanding of the field.