Manufacturing Expert - Quality Evaluator

Remote · USA Full-time New today

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$25–$35/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting professional and consumer domain-specific guidance.
Evaluate AI-generated responses for factual accuracy and practical usefulness.
Identify fabricated claims and misleading reasoning in model outputs.
Score and rank model responses using structured rubrics.
Provide written justifications with specific evidence for evaluations.
*Qualifications
*Must-Have
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Related roles

Senior Product Owner, IaaS (Remote)

Remote · USA Full-time

Staff Product Owner (Oracle Retail)

Remote · USA Full-time

Educational Technology AI Rater & Evaluator

Remote · USA Full-time

Vocational Evaluator

Remote · USA Full-time

AI Decision & Response Analyst

Remote · USA Full-time

NURSE EVALUATOR III, HEALTH SERVICES

Remote · USA Full-time

Finance Model Prompt Evaluator

Remote · USA Full-time

AI Quality Evaluator (Polish)

Remote · USA Full-time

Healthcare Research Evaluator (STEM) | $30/hr Remote

Remote · USA Full-time

Generative AI Evaluator (Russian) | $15/hr Remote

Remote · USA Full-time

Experienced Customer Service Representative - Work from Anywhere Remote Opportunity with arenaflex

Remote · USA Full-time

Bilingual Spanish‑English Remote Customer Service Representative – Full‑Time Home‑Based Support Specialist at arenaflex

Remote · USA Full-time

TSA On Call Data Collector

Remote · USA Full-time

Experienced Customer Service Agent – Part-Time Data Entry Position at arenaflex

Remote · USA Full-time

Electrical Journeyman

Remote · USA Full-time

LATAM - Appointment Setter (Mexico Listed for Platform Purposes)

Remote · USA Full-time

Remote CRM Coordinator - Singapore | No Degree Needed

Remote · USA Full-time

Motion Designer / Video Editor (Contract)

Remote · USA Full-time

Convocatoria 2026 de Contratos Postdoctorales de Investigación del Instituto de Transferencia e Investigación

Remote · USA Full-time

Remote Research & Data Analyst - Work From Home DE

Remote · USA Full-time