Atlas Core uses Hydra to compose model, dataset, trainer, and reward presets. This page is a complete parameter reference for launching GRPO, GKD, or SFT jobs.
For workflow guides, see GRPO Training or GKD Training. This page focuses on exhaustive parameter lookup.
Trainer Base Defaults (src/atlas_core/configs/trainer/base.yaml)
Parameter
Default
When to Change
max_steps
450
Override in run recipe
num_train_epochs
1
Mutually exclusive with max_steps
train_batch_size
64
Effective batch across devices
per_device_train_batch_size
2
Per-rank micro batch
gradient_accumulation_steps
Inferred
Auto-computed if omitted
gradient_checkpointing
true
Memory savings for long contexts
learning_rate
5e-7
Baseline LR for RL
weight_decay
0
Regularization needed
max_grad_norm
1.0
Gradient clipping value
lr_scheduler_type
"cosine"
Constant/linear schedules
warmup_ratio
0.03
Warmup fraction of total steps
bf16 / tf32
true / true
Mixed-precision on supported GPUs
ddp_timeout
18000 seconds
Distributed training timeout
gradient_accumulation_steps is auto-computed: train_batch_size / (per_device_train_batch_size × num_devices). Provide any two values; the launcher resolves the third (see src/atlas_core/cli/train.py).
Extends base GRPO with diagnostic prompts and teacher-specific controls.
Parameter
Default
When to Change
trainer_log_name
teacher_grpo_rw_${reward_log_name}
Appends reward preset name
logging_prob
0.1
Fraction of episodes logged
student_model
null
Co-train student alongside teacher
use_reference_teacher_model
false
Compare vs static reference
completion_only_training
false
Completion-only datasets
trainer_args.max_probe_tokens
500
Diagnostic prompt budget
trainer_args.student_diagnostic_template
Multiline
Reflection prompt (see below)
trainer_args.teacher_adaptive_template
Multiline
Guidance prompt (see below)
trainer_args.student_with_teaching_template
Multiline
Apply feedback prompt (see below)
Default Prompt Templates:
student_diagnostic_template: | Question: {question} Before solving, briefly describe: 1. What type of problem this is 2. The key concepts or steps needed 3. Any potential challenges you seeteacher_adaptive_template: | Question: {question} Student's approach: {approach} <thinking> [Analyze student approach] </thinking> <teaching> [Only guidance to student - no answers] </teaching>student_with_teaching_template: | Question: {question} A teacher has provided: {teaching} Now solve step by step. <solution></solution>