open-source-models agentic-coding self-hosted rl-training moe

Ornith-1.0 open-source coding agents ship four sizes

MIT-licensed agentic models (9B–397B) trained with RL to optimize both solution rollouts and search scaffolds, available in dense and MoE variants with 256K context and OpenAI-compatible serving.

Summary

Replaces closed-model dependency for agentic coding workflows; dense 9B runs on single 80GB GPU, MoE variants shard across multi-GPU nodes. Benchmark numbers show competitive or better performance than comparable open baselines on SWE-bench, Terminal-Bench, and NL2Repo.

Why it matters

Implementation verdict

Production-ready for self-hosted deployment. Requires transformers ≥5.8.1, vLLM ≥0.19.1, or SGLang ≥0.5.9. Model outputs reasoning traces in <think> blocks; parsers surface tool_calls and reasoning_content separately. Dense 9B is lowest-friction entry point; MoE 35B/397B demand multi-GPU infrastructure. Worth trying now if you control serving infra.

Sources

1.Available in 9B-Dense, 31B-Dense, 35B-MoE, and 397B-MoE
2.MIT licensed, globally accessible, and free from regional limitations
3.the model discovers better search trajectories and generates higher-quality solutions
4.the dense 9B fits on a single 80GB GPU
5.transformers >= 5.8.1
6.256K (262,144-token) context window

Dev Signal

Get briefs like this in your inbox — free, every weekday.

100+ sources compressed into one 4-minute read. Ranked, cited, implementation-ready.

Read the full issue →All briefs