AURAONE OPEN · TRUST TOOLKIT · APACHE-2.0

The provenance machinery, source listed.

Eval manifests, regression banks, dataset and embodiment cards, contamination audits. The provenance an audit asks for, with release and runtime proof required before availability is marketed. Nothing you review is represented as uploaded or pooled.

The same checks that sign the evidence inside Human Data OS and App Data OS. Source links are listed; release proof is required.

LOCAL FIRST · NOTHING POOLED · YOU KEEP THE CODE
AGENT TRUST
MCP and A2A

Risk linting, contract tests, trace replay, OTEL bridges, trace cards. CI runtime proof required.

ROBOTICS DATA
LeRobot first

Quality gates, recovery metrics, VLA probes, embodiment cards. Runtime and data-boundary proof required.

REVIEW SURFACE
Open the source

Reproducible failure cases. Review packets a reader can open without an account.

TRUST TOOLKIT CATALOG

Twelve tools you can review for source proof.

Each one is a command-line diagnostic, a portable file a reviewer can diff, a GitHub workflow, or a review packet a reader can open without an account. Apache-2.0 source links are listed, with release proof required before runtime availability is marketed. These are the open primitives the signed evidence in Human Data OS and App Data OS is built on. The reliability layer should be inspectable.

AGENT TRUST · 6 tools

Repo-level regression gates and replay harnesses the agent cannot game. Risk linting, contract tests, OTEL bridges, and trace cards a reviewer can open. CI runtime proof required.

MCP-RISK-LINTER
01
SOURCE LINK

Apache-2.0 · release proof required

Risk lint for MCP manifests.

Risk taxonomy, CLI, and GitHub Action wrapper for MCP manifests, permissions, claims, and unsafe tool surfaces.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
A2A-CONTRACT-TEST
02
SOURCE LINK

Apache-2.0 · release proof required

Contract tests for A2A cards.

Offline contract tests for A2A agent cards, task lifecycle states, structured payloads, and errors.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
TOOL-CALL-REPLAY
03
SOURCE LINK

Apache-2.0 · release proof required

Deterministic agent replay.

Deterministic replay harness that turns failed agent tool-call traces into local regression tests.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
AGENT-TRACE-CARD
04
SOURCE LINK

Apache-2.0 · release proof required

Portable cards for one run.

Portable Markdown and JSON cards for one agent run: tools, retries, data touched, outcome, failure mode.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
OTEL-EVAL-BRIDGE
05
SOURCE LINK

Apache-2.0 · release proof required

OTEL spans into eval cases.

Bridge OpenTelemetry and Phoenix GenAI spans into redacted eval regression cases.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
PROMPT-RUBRIC-DRIFT
06
SOURCE LINK

Apache-2.0 · release proof required

Drift notes for PR review.

No-model PR review notes for prompt and rubric changes: weights, criteria, boundaries, injection exposure.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
ROBOTICS DATA · 4 tools

Local quality gates, failure and recovery metrics, VLA robustness probes, and release cards for teleop and VLA datasets. Review robot datasets without uploading the robot.

LEROBOT-QUALITY-GATES
07
SOURCE LINK

Apache-2.0 · release proof required

LeRobot dataset gates.

Local quality gates for LeRobot-style datasets: metadata, episodes, sensors, action and state fields.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
ROBOT-RECOVERY-BENCH
08
SOURCE LINK

Apache-2.0 · release proof required

Recovery and intervention metrics.

Schema and metrics for human intervention and recovery segments, including repeated-failure clusters.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
VLA-ROBUSTNESS-KIT
09
SOURCE LINK

Apache-2.0 · release proof required

VLA perturbation probes.

Simulator-light VLA diagnostics for language, vision, metadata, task-phase, and embodiment perturbations.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
EMBODIMENT-CARD
10
SOURCE LINK

Apache-2.0 · release proof required

Release cards for robot data.

Structured robot dataset and VLA release cards for sensors, action spaces, frames, control rate, limits.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
REVIEW SURFACE · 2 tools

The failure bank in your hands: reproducible failure cases that keep the mistake so it is caught again. Review packets a reader can open without an account.

FAILURE-GALLERY
11
SOURCE LINK

Apache-2.0 · release proof required

Reproducible failure cases.

Synthetic agent and robotics failure cases with reproducible commands and expected review labels.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
LAB-OUTREACH-KIT
12
SOURCE LINK

Apache-2.0 · release proof required

Packets for lab review.

Technical review packets, no-endorsement language, and a feedback-log schema for lab review asks.

LICENSE
Apache-2.0 · read before you run
SOURCE LINK →
OPEN TOOLING INDEX

Packages, Actions, SDKs, and CLIs you keep.

The twelve tools above, plus the rest of AuraOne Open on GitHub, npm, PyPI, and Homebrew: CI Actions, SDKs, CLIs, schema tools, and example repos from one page. Apache-2.0 source links are listed, with package and runtime proof required before install availability is marketed. Nothing is represented as pooled on our servers.

EVALKIT AND RUBRICS · 10 packages

Portable rubric specs, deterministic scoring, judge diagnostics, IAA, contamination checks, adapters, and eval-run provenance. The provenance machinery an audit asks for; package and local-runtime proof are required before availability is marketed.

auraone-evalkit
PyPI

Local evaluation tooling for rubric validation, linting, and deterministic scoring. Source link listed; package proof required.

PyPI release proof required
rubric-spec
PyPI

Portable AuraOne Rubric Schema v1 validator and adapters.

PyPI release proof required
iaa-kit
PyPI

Modern inter-annotator agreement metrics with bootstrap confidence intervals.

PyPI release proof required
judge-card
PyPI

Judge Card schema, validator, and renderer for judge-model disclosure.

PyPI release proof required
judge-bench
PyPI

Diagnostic probes for LLM-as-judge reliability and bias checks.

PyPI release proof required
eval-adapter
PyPI

Adapters between rubric-spec and common evaluation framework inputs.

PyPI release proof required
eval-run-manifest
PyPI

Signed manifest envelope for eval runs, artifacts, and reproducibility.

PyPI release proof required
contamination-audit
PyPI

N-gram, embedding, canary, answer-pattern, and corpus contamination auditor.

PyPI release proof required
synthetic-disagreement
PyPI

Controlled synthetic annotator disagreement generator for adjudication workflows.

PyPI release proof required
eval-conformance-suite
PyPI

Executable rubric-spec v1 conformance suite and badges.

PyPI release proof required
GITHUB ACTIONS · 4 source links

CI surfaces for eval validation, dataset-card checks, rubric PR feedback, and action smoke fixtures. Action runtime proof is required before workflow availability is marketed.

evalkit-action
GitHub Action

GitHub Action for running EvalKit validation, scoring, and reporting in CI.

GitHub Action proof required
datasheet-ci
GitHub Action

GitHub Action for validating dataset cards and required metadata in pull requests.

GitHub Action proof required
rubric-pr-bot
GitHub Action

Rubric diffs and lint feedback for pull requests that change evaluation criteria.

GitHub Action proof required
open-v2-action-smoke
GitHub

Public smoke tests for AuraOne Open GitHub Action workflows.

source checkout proof required
SDKS AND CLIS · 7 packages

Hosted API SDKs, headless Studio engines, and the browser UI and 3D SDKs you keep. Editable source, your code. Apache-2.0.

auraone-sdk
PyPI

Python SDK and CLI for the AuraOne hosted API.

PyPI release proof required
auraone-agent-studio-open
PyPI

Headless Agent Studio Open protocol, trace-store, sidecar, and export CLI.

PyPI release proof required
robostudio-engine
PyPI

Headless Robotics Studio Open dataset adapters, QA, clustering, orchestration, and exports.

PyPI release proof required
@aura3d/engine
npm

The AI-native 3D SDK for the web. Browser-native scenes and typed GLB and glTF assets.

npm release proof required
aura-glass
npm

The Liquid Glass app-surface system for React and Next.js. Your interface, your dependencies.

npm release proof required
@auraone/rubric-studio
npm

Rubric Studio Open VS Code and JavaScript integration package.

npm release proof required
TAPS AND COOKBOOKS · 3 public repos

Homebrew and desktop Studio listings require current cask, release, and platform proof before install availability is marketed.

homebrew-open
Homebrew

Homebrew listing requires current release proof.

Homebrew release proof required
tap
Homebrew

Homebrew cask listing requires current desktop release proof.

Homebrew cask proof required
agent-studio-cookbook
GitHub

Worked MCP, A2A, OTEL, replay, and CI examples for Agent Studio Open.

source checkout proof required
TRUST TOOLKIT

Read the source. Keep the code.

After a competitor lost four terabytes, including who its workers were, nobody wants tooling that pools their data. Open never does. These are the same QA and provenance checks that sign the evidence in Human Data OS and App Data OS. Source links are listed, while package and runtime proof are required before availability is marketed. Start open. Bring AuraOne in when the problem becomes shared authorship, approval queues, or governance across a team.

AuraOne Trust Toolkit | Evidence-gated package links, Actions, SDKs, and CLIs | AuraOne