Phi 4 reasoning

Phi 4 reasoning

Model family: Phi

Phi-4-reasoning is a state-of-the-art open-weight reasoning model finetuned from Phi-4 using supervised fine-tuning on a dataset of chain-of-thought traces and reinforcement learning. The supervised fine-tuning dataset includes a blend of synthetic prompts and high-quality filtered data from public domain websites, focused on math, science, and coding skills as well as alignment data for safety and Responsible AI. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning.

Overview

Phi-4-reasoning is an open-weight model fine-tuned from Phi-4 with chain-of-thought SFT and reinforcement learning, trained on high-quality synthetic and filtered public-domain data (math, science, coding) plus safety alignment—aimed at delivering strong reasoning in a compact model.

🚀Productivity 📊Pitch decks 💻Coding

About Microsoft

Microsoft is a technology company that offers a wide range of software, cloud computing services, hardware, and artificial intelligence solutions.

Industry: Technology, Information and Internet

Company Size: 228000+

Location: Redmond, Washington, US

Website: microsoft.com

View Company Profile

Tools using Phi 4 reasoning

No tools found for this model yet.

Last updated: February 26, 2026

Search

Overview

About Microsoft

Other models from this family

Tools using Phi 4 reasoning

Related Models

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: