AURAONE OPEN · TRUST TOOLKIT · APACHE-2.0

The provenance machinery, source listed.

Eval manifests, regression banks, dataset and embodiment cards, contamination audits. The provenance an audit asks for, with release and runtime proof required before availability is marketed. Nothing you review is represented as uploaded or pooled.

The same checks that sign the evidence inside Human Data OS and App Data OS. Source links are listed; release proof is required.

Read the source Back to Open

LOCAL FIRST · NOTHING POOLED · YOU KEEP THE CODE

AGENT TRUST

MCP and A2A

Risk linting, contract tests, trace replay, OTEL bridges, trace cards. CI runtime proof required.

ROBOTICS DATA

LeRobot first

Quality gates, recovery metrics, VLA probes, embodiment cards. Runtime and data-boundary proof required.

REVIEW SURFACE

Open the source

Reproducible failure cases. Review packets a reader can open without an account.

TRUST TOOLKIT CATALOG

Twelve tools you can review for source proof.

Each one is a command-line diagnostic, a portable file a reviewer can diff, a GitHub workflow, or a review packet a reader can open without an account. Apache-2.0 source links are listed, with release proof required before runtime availability is marketed. These are the open primitives the signed evidence in Human Data OS and App Data OS is built on. The reliability layer should be inspectable.

AGENT TRUST · 6 tools

Repo-level regression gates and replay harnesses the agent cannot game. Risk linting, contract tests, OTEL bridges, and trace cards a reviewer can open. CI runtime proof required.

MCP-RISK-LINTER

№ 01

SOURCE LINK

Apache-2.0 · release proof required

Risk lint for MCP manifests.

Risk taxonomy, CLI, and GitHub Action wrapper for MCP manifests, permissions, claims, and unsafe tool surfaces.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

A2A-CONTRACT-TEST

№ 02

SOURCE LINK

Apache-2.0 · release proof required

Contract tests for A2A cards.

Offline contract tests for A2A agent cards, task lifecycle states, structured payloads, and errors.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

TOOL-CALL-REPLAY

№ 03

SOURCE LINK

Apache-2.0 · release proof required

Deterministic agent replay.

Deterministic replay harness that turns failed agent tool-call traces into local regression tests.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

AGENT-TRACE-CARD

№ 04

SOURCE LINK

Apache-2.0 · release proof required

Portable cards for one run.

Portable Markdown and JSON cards for one agent run: tools, retries, data touched, outcome, failure mode.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

OTEL-EVAL-BRIDGE

№ 05

SOURCE LINK

Apache-2.0 · release proof required

OTEL spans into eval cases.

Bridge OpenTelemetry and Phoenix GenAI spans into redacted eval regression cases.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

PROMPT-RUBRIC-DRIFT

№ 06

SOURCE LINK

Apache-2.0 · release proof required

Drift notes for PR review.

No-model PR review notes for prompt and rubric changes: weights, criteria, boundaries, injection exposure.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

ROBOTICS DATA · 4 tools

Local quality gates, failure and recovery metrics, VLA robustness probes, and release cards for teleop and VLA datasets. Review robot datasets without uploading the robot.

LEROBOT-QUALITY-GATES

№ 07

SOURCE LINK

Apache-2.0 · release proof required

LeRobot dataset gates.

Local quality gates for LeRobot-style datasets: metadata, episodes, sensors, action and state fields.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

ROBOT-RECOVERY-BENCH

№ 08

SOURCE LINK

Apache-2.0 · release proof required

Recovery and intervention metrics.

Schema and metrics for human intervention and recovery segments, including repeated-failure clusters.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

VLA-ROBUSTNESS-KIT

№ 09

SOURCE LINK

Apache-2.0 · release proof required

VLA perturbation probes.

Simulator-light VLA diagnostics for language, vision, metadata, task-phase, and embodiment perturbations.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

EMBODIMENT-CARD

№ 10

SOURCE LINK

Apache-2.0 · release proof required

Release cards for robot data.

Structured robot dataset and VLA release cards for sensors, action spaces, frames, control rate, limits.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

REVIEW SURFACE · 2 tools

The failure bank in your hands: reproducible failure cases that keep the mistake so it is caught again. Review packets a reader can open without an account.

FAILURE-GALLERY

№ 11

SOURCE LINK

Apache-2.0 · release proof required

Reproducible failure cases.

Synthetic agent and robotics failure cases with reproducible commands and expected review labels.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

LAB-OUTREACH-KIT

№ 12

SOURCE LINK

Apache-2.0 · release proof required

Packets for lab review.

Technical review packets, no-endorsement language, and a feedback-log schema for lab review asks.

LICENSE

Apache-2.0 · read before you run

SOURCE LINK →

OPEN TOOLING INDEX

Packages, Actions, SDKs, and CLIs you keep.

The twelve tools above, plus the rest of AuraOne Open on GitHub, npm, PyPI, and Homebrew: CI Actions, SDKs, CLIs, schema tools, and example repos from one page. Apache-2.0 source links are listed, with package and runtime proof required before install availability is marketed. Nothing is represented as pooled on our servers.

EVALKIT AND RUBRICS · 10 packages

Portable rubric specs, deterministic scoring, judge diagnostics, IAA, contamination checks, adapters, and eval-run provenance. The provenance machinery an audit asks for; package and local-runtime proof are required before availability is marketed.

auraone-evalkit

PyPI

Local evaluation tooling for rubric validation, linting, and deterministic scoring. Source link listed; package proof required.

PyPI release proof required

The provenance machinery, source listed.

Twelve tools you can review for source proof.

Risk lint for MCP manifests.

Contract tests for A2A cards.

Deterministic agent replay.

Portable cards for one run.

OTEL spans into eval cases.

Drift notes for PR review.

LeRobot dataset gates.

Recovery and intervention metrics.

VLA perturbation probes.

Release cards for robot data.

Reproducible failure cases.

Packets for lab review.

Packages, Actions, SDKs, and CLIs you keep.

Read the source. Keep the code.