Back to all jobs

AI Model Evaluator

Reddit r/forhire
Apply NowSign in to track
AI-enhanced for better readability

AI Model Evaluators - Remote

Source: reddit-r-forhire

Company: micro1

About the Role

micro1 is hiring AI Model Evaluators (LLM & Agent Systems) for a remote, part-time contract.

Type: Contract

Location: Remote

Openings: 7

Compensation: $20 - $30 per hour

Commitment: 20+ hours/week

Responsibilities

  • Evaluate LLM and autonomous agent outputs using structured rubrics
  • Review multi-step reasoning traces and screenshots
  • Identify failure modes and recurring patterns
  • Provide clear, actionable feedback to improve AI systems
  • Participate in calibration sessions for scoring alignment

Requirements

  • Experience with AI evaluation, QA/testing, or benchmarking
  • Strong attention to detail and rubric-based scoring skills
  • B2+ English proficiency
  • Comfortable working independently in a remote setting

Preferred Skills

  • RLHF experience
  • Annotation workflows
  • Agent systems familiarity
  • Digital product evaluation background

Ideal for professionals interested in AI quality assessment, benchmarking, and improving next-generation generative AI systems.

Disclosure: I’m sharing this as an independent member of the micro1 referral program

Similar jobs