Ornith 1.0 397B
Overview
Ornith-1.0-397B is a 397B MoE open-source reasoning model for agentic coding, post-trained on Qwen 3.5 MoE via a self-improving RL framework that jointly learns task solutions and the scaffolds guiding them. Achieves 82.4 on SWE-Bench Verified and 77.5 on Terminal-Bench 2.1. MIT licensed with tool-calling and 256K context support.
About DeepReinforce
DeepReinforce is an AI research startup founded by Dr. Jiwei Li, focused on using reinforcement learning to build agentic AI systems for coding and system optimization. They developed GrandCode (ranked #1 in Codeforces live competitions, beating all human grandmasters), Ornith-1.0 (open-source LLMs for agentic coding, 9Bโ397B parameters), and IterX (agentic code optimizer surpassing NVIDIA's cuBLAS).
View Company Profile