Training Pipeline
Stages
- Knowledge Baking — Commands, recipes, enchantments as training examples
- API Distillation — Frontier models generate gold-standard responses
- RCON Validation — Commands executed on real server
- Self-Play — Model generates edge cases, learns from failures
Data: 3,500+ examples
| Category | Count |
|---|---|
| Command syntax | 107 |
| Crafting recipes | 176 |
| Enchantments | 60 |
| Entities/mobs | 60 |
| Gamerules + safety | 224 |
| Tool-calling | 1,159 |
| API-distilled | 344 |
| Bot interactions | 448+ |