Ornith 1.0 9B
Overview
Open-source 9B-parameter language model specialized for agentic coding tasks. Post-trained on Qwen 3.5 using a self-improving RL framework that jointly learns to generate solutions and task-specific scaffolds. Achieves state-of-the-art results on SWE-bench Verified (69.4%) and Terminal-Bench 2.1 among comparable models. MIT licensed.
About DeepReinforce
DeepReinforce is an AI research startup founded by Dr. Jiwei Li, focused on using reinforcement learning to build agentic AI systems for coding and system optimization. They developed GrandCode (ranked #1 in Codeforces live competitions, beating all human grandmasters), Ornith-1.0 (open-source LLMs for agentic coding, 9Bโ397B parameters), and IterX (agentic code optimizer surpassing NVIDIA's cuBLAS).
View Company Profile