code-reasoning open-source-models inference-cost rl-training benchmarks

Together releases DeepCoder-14B coding model

14B open-source model matches o3-mini on code tasks; full training recipe, dataset, and RL framework included for reproducibility.

Summary

Eliminates dependency on closed API-gated models for competition-level coding benchmarks. Developers can now audit training, fine-tune on proprietary codebases, and run inference on consumer hardware without token costs.

Why it matters

Implementation verdict

Replaces o3-mini API calls for coding tasks if latency tolerance exists. Requires GPU with 14B model capacity (28GB VRAM minimum) and integration via Hugging Face Transformers. Training cost documented at ~$27K; worth evaluating now as baseline for local reasoning-based coding agents.

Sources

1.open-source coding model that rivals OpenAI's o3-mini and o1 on coding tasks
2.costing approximately $26,880 to train
3.achieves a 60.6% score on LiveCodeBench and 1936 on CodeForces, performing on par with o3-mini (low) and o1 on competition-level coding tasks
4.The model, training code, dataset, and a detailed blog are available
5.trained using an open-source RL framework from ByteDance

Dev Signal

Get briefs like this in your inbox — free, every weekday.

100+ sources compressed into one 4-minute read. Ranked, cited, implementation-ready.

Read the full issue →All briefs