MODELS · RL ENVIRONMENTS

The environments your team trains in. Reviewed and signed.

The teams who run the work write the environment. One review, one signed approval, one shared registry. Train your model against it, and keep the weights.

Start a project See pricing

CATALOG

22 environments

Approved environments in the shared registry, each with its owner and version attached.

RUNS

48 runs

Training runs executed against approved environments, each logged to the team that ran it.

SIGNED

Approved once

A reviewer signature, a registry version, and a usage log on every environment that ships.

HOW IT WORKS

Three steps. One registry.

Build the environment. Review and approve it once. Share it across every team from one registry.

STEP 01

WHAT GETS BUILT

Build the environment

The teams who run the work spec the environment. Reward, reset, state, action - written once by the people who use it.

→

STEP 02

WHAT GETS APPROVED

Review and approve

Every submission runs one review: static checks, policy gates, and a human signature before any team can train against it.

→

STEP 03

WHAT GETS REUSED

Share across teams

Approved environments land in the registry with version, owner, and usage log attached. One source every team trains from.

WHAT COMES OUT

What your team leaves with.

Every approval leaves a record. Every run leaves a log. Every team trains against the same approved environment, with the proof attached.

Environment specs

Reward, reset, state, and action - written down by the people who run the work.

↳ ARTIFACT

Approval records

Static checks, policy gates, and the reviewer signature that opened the door.

↳ ARTIFACT

Version registry

Every approved environment with its version, owner, lineage, and the runs that depend on it.

↳ ARTIFACT

Usage logs

Which team ran it, which run consumed it, and what the result was.

↳ ARTIFACT

WHERE IT FITS

In the loop, this is where you test.

Test the run against the environment. Review what broke. Recruit the specialist. Remember the miss. Approve what's right.

Test

● YOU ARE HERE

Review

Recruit

Remember

Approve

READOUT · DECISION GRAPH

RELATED MODULES

Next to this in Models.

TRAINING

Train against the same environment.

Every team starts from the same approved spec, and keeps the weights they tune on it. No drift between research and rollout review.

See the page →

AUTOPILOT

Run the workflow without the toil.

Schedule runs across approved environments. The catalog is the contract.

See the page →

FEDERATED LEARNING

Learn together. Keep it separate.

Train across teams on the same approved environment. Share the updates, not the data.

See the page →

RL ENVIRONMENTS

Built by the people who use them.

Bring the environment your team already runs. We review it, sign it, and make it shared. You train against it, and keep the weights.

Start a project See pricing