Manufacturing Expert - Quality Evaluator

Remote, USA Full-time Posted 2026-06-13

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$25–$35/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting professional and consumer domain-specific guidance.
Evaluate AI-generated responses for factual accuracy and practical usefulness.
Identify fabricated claims and misleading reasoning in model outputs.
Score and rank model responses using structured rubrics.
Provide written justifications with specific evidence for evaluations.
*Qualifications
*Must-Have
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Similar Jobs

Senior Product Owner, IaaS (Remote)

Remote, USA Full-time

Staff Product Owner (Oracle Retail)

Remote, USA Full-time

Educational Technology AI Rater & Evaluator

Remote, USA Full-time

Vocational Evaluator

Remote, USA Full-time

AI Decision & Response Analyst

Remote, USA Full-time

NURSE EVALUATOR III, HEALTH SERVICES

Remote, USA Full-time

Finance Model Prompt Evaluator

Remote, USA Full-time

AI Quality Evaluator (Polish)

Remote, USA Full-time

Healthcare Research Evaluator (STEM) | $30/hr Remote

Remote, USA Full-time

Generative AI Evaluator (Russian) | $15/hr Remote

Remote, USA Full-time

Sales Recruiter Customer Excellence- (West Coast, Remote Eligible)

Remote, USA Full-time

Remote Certified Medical Assistant- Bilingual Spanish

Remote, USA Full-time

Global LMS Help and Support Lead (Remote)

Remote, USA Full-time

Lead Azure Data Engineer Remote

Remote, USA Full-time

Experienced Customer Service Representative – Delivering Exceptional Experiences at arenaflex

Remote, USA Full-time

Experienced Part-Time Data Entry Specialist – Remote Opportunity at arenaflex

Remote, USA Full-time

Remote Cardiologist - AI Trainer ($300-$400 per hour)

Remote, USA Full-time

Experienced Sr Director, Customer and Marketing Data, Applied AI, and Analytics – Visionary Leader for arenaflex's Digital Transformation

Remote, USA Full-time

Telecommunications Project Lead - 100% REMOTE

Remote, USA Full-time

Experienced Part-Time Remote Customer Service Representative – Entertainment Industry

Remote, USA Full-time