Skip to main content
ATLAS Hero Image
Have questions? Chat with the docs using the assistant at the bottom.

What is ATLAS?

ATLAS is a continual learning framework for production LLM agents. It combines runtime quality control with offline reinforcement learning to improve agent reliability, reduce token costs, and build domain expertise through persistent memory. The system layers a dual-agent reasoning loop (student + verifying teacher) on top of any model. The Atlas SDK streams causality traces into Postgres, and Atlas Core (this repository) trains new teacher checkpoints via on-policy distillation (GKD) or reinforcement learning (GRPO).
ATLAS System Architecture

Atlas runtime captures traces; Atlas Core trains improved models from those traces.

Which Repository Do You Need?

Atlas SDK

Use if you want to:
  • Run agents with quality control
  • Get runtime supervision and retries
  • Export traces for later training
Repository: atlas-sdkInstallation: pip install arc-atlasStart: SDK Quickstart

Atlas Core

Use if you want to:
  • Train custom teacher models
  • Run GKD or GRPO training
  • Fine-tune on exported traces
Repository: ATLASInstallation: See Installation GuideStart: GKD Training
Most teams only need the SDK. Atlas Core requires GPUs and is only necessary if you are training custom models. The SDK works with any OpenAI-compatible API.

Why Use ATLAS?

BenefitHowResult
Lower costsAdaptive supervision allocates reasoning only when needed~50% token reduction
Higher reliabilityReal-time teacher review catches errors before production impact+15.7% avg task success
Continuous improvementOffline training (GRPO/GKD) updates teacher from production tracesCompounding expertise over time
→ Full methodology in Technical Report → System concepts in Adaptive Dual-Agent Reasoning
Runtime vs. Training: The Atlas SDK handles runtime orchestration and trace export. Atlas Core (this repository) handles offline training (GKD/GRPO).

End-to-End Workflow

StageRun ThisOutputTime
Runtime quality controlatlas runReviewed traces with reward scoresMinutes
Export tracesarc-atlas exportJSONL dataset from approved sessionsMinutes
Train teacher (GKD)scripts/run_offline_pipeline.pyDistilled teacher checkpoint4-8 hours
Train teacher (GRPO)configs/run/teacher_rcl.yamlRL-optimized teacher checkpoint24-48 hours
Every stage feeds the next—runtime traces become training data; trained checkpoints redeploy to runtime.

Getting Started: Two Paths

I want to…PathStart Here
Run tasks with dual-agent orchestrationAtlas SDKSDK Quickstart
Wrap my existing agent in quality-control loopAtlas SDKBYOA Adapters
Distill traces into smaller teacherAtlas CoreGKD Training
Train from rewards (RL)Atlas CoreGRPO Training

Research & Resources