Back to all jobs

AI Coding Agent Evaluator at Mercor

Reddit r/forhire
Apply NowSign in to track
AI-enhanced for better readability

Coding Sprint [Hiring]

Source: reddit-r-forhire

This is a sprint-based project. Tasks are typically released in 12–24 hour windows and contributors are onboarded on a first-come, first-served basis. Interested candidates are encouraged to apply as soon as possible.

Mercor is currently hiring experienced engineers to evaluate frontier AI coding agents in partnership with a leading AI research lab.

Compensation

  • $400 per task (Estimated $70–$120/hr)

Available Roles

Responsibilities

The work involves:

  • Reviewing AI-generated code.
  • Identifying bugs and edge cases.
  • Comparing outputs from different frontier models.
  • Evaluating engineering quality in realistic technical scenarios.

Key Details

  • Remote and flexible work environment.
  • Compensation: ~$400 per accepted task.
  • Time Commitment: Most tasks take approximately 1–3 hours after ramp-up.
  • Task Limits: No fixed task limits.
  • Preferred Experience: Experience with AI coding tools such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or similar is highly valued.

Application Process

The application process is straightforward and typically includes two relatively short interviews or assessments. If selected, you will receive onboarding instructions and sprint schedules in advance, allowing you to plan your availability before new task windows open.

If you decide to apply, please let the recruiter know. They may be able to help track your application status and provide updates when available.

Contact

Bartlomiej Lukasiewicz
LinkedIn Profile

Similar jobs