Ornith 1.0 35B
Overview
Ornith-1.0-35B is a 35B Mixture-of-Experts reasoning model for agentic coding, post-trained on Qwen 3.5 using a self-improving RL framework that jointly learns solution rollouts and the task-specific scaffolds guiding them. Supports native function calling and 262K context. Scores 75.6 on SWE-Bench Verified and 64.2 on Terminal-Bench 2.1. MIT licensed.
About DeepReinforce
DeepReinforce is an AI research startup founded by Dr. Jiwei Li, focused on using reinforcement learning to build agentic AI systems for coding and system optimization. They developed GrandCode (ranked #1 in Codeforces live competitions, beating all human grandmasters), Ornith-1.0 (open-source LLMs for agentic coding, 9Bโ397B parameters), and IterX (agentic code optimizer surpassing NVIDIA's cuBLAS).
View Company Profile