Ornith 1.0 35B

Ornith 1.0 35B

Model family: Ornith

Ornith-1.0-35B is the mid-tier model in the Ornith-1.0 family, a 35B MoE architecture post-trained on Qwen 3.5. It is a reasoning model that produces explicit think traces before final answers and emits well-formed function calls via the tool_calls field. The core innovation is a self-improving RL training framework: rather than relying on fixed human-designed harnesses, the model co-learns solution rollouts and the task-specific scaffolds that guide them. Reward is propagated to both stages, enabling per-task orchestration strategies to emerge automatically. The context window is 262,144 tokens. On agentic coding benchmarks, it scores 75.6 on SWE-Bench Verified, 50.4 on SWE-Bench Pro, 69.3 on SWE-Bench Multilingual, and 64.2 on Terminal-Bench 2.1, outperforming Qwen 3.5-397B on Terminal-Bench 2.1 despite being 10x smaller. Compatible with vLLM, SGLang, and standard Transformers libraries. MIT licensed.

Overview

Ornith-1.0-35B is a 35B Mixture-of-Experts reasoning model for agentic coding, post-trained on Qwen 3.5 using a self-improving RL framework that jointly learns solution rollouts and the task-specific scaffolds guiding them. Supports native function calling and 262K context. Scores 75.6 on SWE-Bench Verified and 64.2 on Terminal-Bench 2.1. MIT licensed.

‍💻Code generation 🤔Logical reasoning 💻Vibe coding 🤖Ai agents

About DeepReinforce

DeepReinforce is an AI research startup founded by Dr. Jiwei Li, focused on using reinforcement learning to build agentic AI systems for coding and system optimization. They developed GrandCode (ranked #1 in Codeforces live competitions, beating all human grandmasters), Ornith-1.0 (open-source LLMs for agentic coding, 9B–397B parameters), and IterX (agentic code optimizer surpassing NVIDIA's cuBLAS).

Industry: Artificial Intelligence

Location: US

Website: deep-reinforce.com

View Company Profile

Other models from this family

View all models from this family

Last updated: July 17, 2026

Go to section

Search

Overview

About DeepReinforce

Other models from this family

Related Models

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: