Refusal Preference Reward Model Evaluator is a remote evaluation track for reviewing refusal preference reward model evaluation prompts and responses against AuraOne's quality rubric.
Aligned to the AuraOne specialist routing.
Remote-first specialist work, paid per accepted task.
Remote — US-eligible
Refusal Preference Reward Model Evaluator is a remote evaluation track for reviewing refusal preference reward model evaluation prompts and responses against AuraOne's quality rubric. Reviewers compare paired outputs, label edge cases, and write the kind of structured feedback the modeling team can use to retrain.
AI data reviewers help turn refusal preference reward model evaluation outputs into auditable labels, rationales, and regression cases for AuraOne Human Data.
Produce preference rankings, reward-model feedback, and calibrated human judgment for post-training pipelines.
Hourly rate confirmed after the interview process.
Expected schedule: contractor, remote specialist work with program-defined task volume and review pacing.
AuraOne uses a shared specialist intake to confirm track fit, review readiness, and the best queue for your profile. Applications submitted from partner job boards carry the source, role, and category on the apply URL.
Submit your specialist intake once. AuraOne routes you to the track that matches your work and reviews your file.