Benchmark Dataset Data Specialist is a remote evaluation track for reviewing benchmark dataset data evaluation prompts and responses against AuraOne's quality rubric.
Aligned to the AuraOne specialist routing.
Remote-first specialist work, paid per accepted task.
Remote — US-eligible
Benchmark Dataset Data Specialist is a remote evaluation track for reviewing benchmark dataset data evaluation prompts and responses against AuraOne's quality rubric. Reviewers compare paired outputs, label edge cases, and write the kind of structured feedback the modeling team can use to retrain.
AI data reviewers help turn benchmark dataset data evaluation outputs into auditable labels, rationales, and regression cases for AuraOne Human Data.
Design, audit, and improve structured data, label taxonomies, knowledge graphs, and evaluation datasets.
Hourly rate confirmed after the interview process.
Expected schedule: contractor, remote specialist work with program-defined task volume and review pacing.
AuraOne uses a shared specialist intake to confirm track fit, review readiness, and the best queue for your profile. Applications submitted from partner job boards carry the source, role, and category on the apply URL.
Submit your specialist intake once. AuraOne routes you to the track that matches your work and reviews your file.