Back to all jobs

Generalist Evaluator Expert

Reddit r/remotejs
Apply NowSign in to track
AI-enhanced for better readability

HIRING: Remote Generalist Evaluator Expert

Hourly Contract, Remote

Compensation: $35 - $40 per hour

About the Role:

Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text.

Apply Here:

https://work.mercor.com/jobs/list_AAABmVWUijSELBRTIP5ADKXs?referralCode=cfcd3857-1bb2-44d9-a07e-7a7ca2437c7c&utm_source=referral&utm_medium=share&utm_campaign=job_referral

Responsibilities:

  • Design and Optimize Prompts: Create detailed prompts with multiple constraints and instructions.
  • Define and Document Evaluation Standards: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubrics.
  • Conduct Model Testing and Grading: Run prompts through models and assess preliminary outputs against expectations.
  • Support Benchmarking and Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks.

Minimum Qualifications:

  • BS or BA from a reputable institution completed or in progress
  • Strong writing and critical thinking skills.
  • Ability to work independently and meet deadlines.
  • Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests.

Preferred Qualifications:

  • Experience in teaching or research.

Application & Onboarding Process:

  • Complete an AI-led interview (approximately 15 minutes).
  • Complete a 45-minute written assessment that will guide you through writing rubrics.
  • If selected, you will be invited to work on the project.

More Details About This Role:

  • This is a remote and asynchronous role — work on your own schedule.
  • Expect to contribute at least 20 hours per week.
  • Expect a commitment of around 1 month.
  • You’ll be working in a structured project environment with clear goals and tools.

Similar jobs