TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Phi 4 reasoning

Model family: Phi
Phi-4-reasoning is a state-of-the-art open-weight reasoning model finetuned from Phi-4 using supervised fine-tuning on a dataset of chain-of-thought traces and reinforcement learning. The supervised fine-tuning dataset includes a blend of synthetic prompts and high-quality filtered data from public domain websites, focused on math, science, and coding skills as well as alignment data for safety and Responsible AI. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning.
New Text Gen 7
Released: April 30, 2025

Overview

Phi-4-reasoning is an open-weight model fine-tuned from Phi-4 with chain-of-thought SFT and reinforcement learning, trained on high-quality synthetic and filtered public-domain data (math, science, coding) plus safety alignment—aimed at delivering strong reasoning in a compact model.

About Microsoft

Microsoft is a technology company that offers a wide range of software, cloud computing services, hardware, and artificial intelligence solutions.

Industry: Software Development
Company Size: 228000+
Location: Redmond, Washington, US
View Company Profile

Tools using Phi 4 reasoning

No tools found for this model yet.

Last updated: February 3, 2026
0 AIs selected
Clear selection
#
Name
Task