DeepSeekMath V2
Overview
DeepSeek-Math-V2 is a math-specialized LLM built on DeepSeek-V3.2-Exp-Base, trained to generate and verify step-by-step proofs. It uses a learned verifier as a reward model so the generator learns to fix its own reasoning, reaching gold-level scores on contests like IMO 2025, CMO 2024, and near-perfect Putnam 2024 with scaled test-time compute.
Description
About DeepSeek
DeepSeek is a Chinese AI firm specializing in large language models, based in Hangzhou.
