Phi 4 reasoning plus

Overview

Phi-4-reasoning-plus is an open-weight reasoning model derived from Phi-4, trained with chain-of-thought supervised fine-tuning and extra reinforcement learning on high-quality synthetic and filtered public-domain data (math, science, coding) with safety alignment. It delivers stronger reasoning in a small model, at the cost of ~50% longer outputs and higher latency.

Description

Phi-4-reasoning-plus is a state-of-the-art open-weight reasoning model finetuned from Phi-4 using supervised fine-tuning on a dataset of chain-of-thought traces and reinforcement learning. The supervised fine-tuning dataset includes a blend of synthetic prompts and high-quality filtered data from public domain websites, focused on math, science, and coding skills as well as alignment data for safety and Responsible AI. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning. Phi-4-reasoning-plus has been trained additionally with Reinforcement Learning, hence, it has higher accuracy but generates on average 50% more tokens, thus having higher latency.

About Microsoft

Microsoft is a technology company that offers a wide range of software, cloud computing services, hardware, and artificial intelligence solutions.

Industry: Software Development

Company Size: 10001+

Location: Redmond, Washington, US

Website: microsoft.com

View Company Profile

Related Models

Last updated: October 15, 2025

Overview

Description

About Microsoft

Related Models

DBRX Instruct

VibeThinker-1.5B

GLM 4.5

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool