Internal demo · Annotation surfaces

Annotation, in preview.

A read-only walkthrough of the shipped annotation surfaces — offline-durability studio, pixel-mask with SAM2 assist, video action timelines, voice comments, structured bulk-edit, calibration-aware cuboids, Quality Hub, the export gate, and adjudication auto-escalation. Every count on this page reads from the platform’s live metrics feed.

IAA median · 30d
0.84
reopen rate · 30d
1.8%
exports passing gate
26/30
queue depth
14

Studio durability

Offline queue. Reconnect replay.

When a connection drops, the editor queues every operation to local storage and replays them in order the moment the socket is back. No lost work, no duplicate labels.

  1. 00:00

    Connected

    Editor online · autosave to server every 10s.

  2. 00:14

    Connection drops

    Network partition detected · work queues to IndexedDB.

  3. 00:41

    Offline durability

    Operator keeps labeling · 37 ops queued locally.

  4. 01:03

    Reconnect + replay

    Queue drains in order · CRDT resolves overlaps.

Durability is backed by IndexedDB + a CRDT replay log on the Studio client.

Image · pixel-mask + SAM2 assist

Mask, correct, confirm.

SAM2 proposes a pixel-mask. The annotator accepts, corrects, or redraws — every stroke stays attached to the record.

Tool palette

  • Polygon
  • Brush
  • SAM2 assist
  • Eraser

SAM2 proposals come from the separately-deployed inference server; the editor never blocks while proposals compute.

Video · action recognition

Tracks, per action, per record.

Three tracks. Each segment is a labeled action in the clip. Scrubbing updates the spans — the review queue sees the same timeline the annotator drew.

clip-rbt-008 · 01:48fps 30 · key 15s
grasp2 spans
transfer2 spans
release2 spans

Audio · threaded voice comments

Review by voice. Stays on the record.

Reviewers leave voice notes directly on a segment. The thread, the waveform, and the transcript all stay on the same task record instead of scattering across chat tools.

  • Lead reviewer0:14

    Missed the silence at 00:04 — re-run diarization.

  • Annotator0:21

    Agreed. Updated boundary + attached transcript.

  • QA0:09

    Looks clean now. Approving the segment.

Structured · bulk edit + taxonomy

Tabular work, schema-aware.

Apply a taxonomy change across a selection. The schema editor and the bulk editor share the same source of truth — no downstream re-imports.

Taxonomy editor

  • Transactions
    • returns
    • disputes
    • refunds
    • chargebacks
Selection · 5 of 24Bulk apply · category → returns
idcategoryprioritystate
txn-00481returnsp2queued
txn-00482returnsp2queued
txn-00483disputesp1queued
txn-00484returnsp2queued
txn-00485refundsp3queued

3D · calibration-aware cuboid

Cuboids that carry their calibration.

LiDAR and depth annotations stay attached to the sensor frame, extrinsics, and time sync they were drawn against. Export-ready without a recalibration pass.

Calibration envelope

Sensor frame
lidar_top
Extrinsics RMSE
0.014 m
Intrinsics version
2026.03
Time sync
±2.8 ms

Quality Hub · preview

IAA. Drift. Reopen rate. One hub.

Trend line, drift heatmap, reopen sparkline, and the active rule set live on one page. Sign-in to the platform for the full Quality Hub at /annotation/quality.

IAA median · 30d

0.84

Target 0.75 · trend up

Drift heatmap · per annotator / per week

Cells above 0.05 force adjudication automatically.

Reopen rate · 30d

1.8%

Ceiling 2.0% · below target.

Active rules

  • IAA floor

    Blocks export when IAA median < 0.80 across the gold set.

  • Reopen-rate ceiling

    Auto-escalates when reopen rate > 3% for any annotator.

  • Drift guard

    Forces adjudication when drift vs baseline > 0.05.

Export gate

Blocks weak data before it leaves.

Four checks run before an export ships. If any check fails, the export is blocked until the team clears the finding — no weak dataset gets delivered by accident.

Export job · dataset-471
Blocked
  • IAA floor0.84 / 0.80
  • Reopen rate1.8% / 2.0%
  • Gold-set coverage82% / 95%
  • Adjudication backlog0 / 0

Live snapshot: 26 of 30 recent export jobs passed the gate.

Adjudication · auto-escalation

From disagreement to decision.

Threshold rules watch the queue. When agreement drops or reopen rate spikes, records auto-route to adjudication with the evidence already attached.

  1. 01

    Disagreement detected

    Two labelers, same record, conflicting spans.

  2. 02

    Threshold rule evaluates

    Reopen-rate + drift checks against live thresholds.

  3. 03

    Auto-escalate to adjudicator

    Rule fires — record routed to the adjudication queue.

  4. 04

    Decision + audit

    Adjudicator resolves; evidence + timestamps stored on record.

Queue depth: 14 · auto-escalated 6 · median wait 38 min
0.840.0182614

Next path

The production surface is one sign-in away.

Open the Annotation product page for the full story, or sign in to run a real session with your own dataset and rubric.

Annotation demo · read-only preview · live-metrics JSON bound