DEV Community

Pink_Developer
Pink_Developer

Posted on

AI Generalist Evaluator Remote Job $35-$40/hr

Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text.

🟣Job Details:
Design and Optimize Prompts: Create detailed prompts with multiple constraints and instructions.

Define and Document Evaluation Standards: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric.

Conduct Model Testing and Grading: Run prompts through models and assess preliminary outputs against expectations.

Support Benchmarking and Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks.

🟣 Apply Now

Top comments (0)