Local-first IDE for MCP and A2A agents.
Connect, inspect, record, replay, compare model behavior, ingest OTEL spans, and export regression suites to CI.
Four IDEs and a library of engines for agent reliability, rubric authoring, robotics review, and dataset trust. Public source. No account. Your data stays on disk.
Engines, IDEs, actions, packets — every one public, every one MIT.
Agent Studio, Rubric Studio, Robotics Studio, and the Open v2 set.
Frontier labs, agent platforms, and PR review workflows already in.
Each one runs on disk, ships MIT, and exports a portable artifact a reviewer can pick up without a hosted account.
Connect, inspect, record, replay, compare model behavior, ingest OTEL spans, and export regression suites to CI.
Local, file-based, git-friendly authoring for criterion-level evaluations. Author, test, calibrate, diff, and export.
Open LeRobot, RLDS, OpenX, HDF5, ROS bag, and mp4/jsonl captures. Scrub, tag, cluster, probe, export.
Twelve installable packages for MCP/A2A review, trace replay, robotics data quality, VLA diagnostics, and review packets.
The libraries underneath every Open IDE. Each runs from a notebook, a CLI, or a GitHub Action.
Risk taxonomy and lint pass for MCP server manifests.
Offline contract tests for A2A agent cards and task lifecycles.
Deterministic replay harness for failed agent tool calls.
Portable Markdown and JSON card for one agent run.
Bridge OTEL and Phoenix GenAI spans into eval regression cases.
Read the source. Run it locally. Bring AuraOne in when shared state is the actual problem.