Models

💻Coding 🔍Code reviews ✉️Codebase Q&A

MiMo V2.5 Pro UltraSpeed

By Xiaomi

MiMo-V2.5-Pro-UltraSpeed is Xiaomi MiMo’s 1T-parameter high-speed reasoning model variant built with TileRT to reach 1000+ tokens per second generation on commodity GPU infrastructure.

NewText

Released 2d ago

🤖Agents 🤖Process automations 🔎Problem solving 🤔Logical reasoning

Aion 1.0 Plan

By Microsoft

Aion 1.0 Plan is Microsoft’s 14B on-device reasoning and tool-calling model for fully local agentic workflows on capable Windows devices.

NewMultimodal

Released 2d ago

🔍Data extraction 🔍Image interpretation 📜OCR

LFM2.5 VL 1.6B Extract

By Liquid AI

LFM2.5-VL-1.6B-Extract is Liquid AI’s larger vision-language extraction model for image-to-JSON structured field extraction.

NewMultimodal

Released 2d ago

🔍Code analysis 🔍Data extraction

Cassini 1.0

By LatentForce

Cassini-1.0 is LatentForce’s 4B code-intelligence model for extracting structured symbolic metadata from Python, JavaScript, and TypeScript source files.

NewText

Released 3d ago

💬Chatting 🈯️Japanese content generation

LFM 2.5 1.2B JP 202606

By Liquid AI

LFM2.5-1.2B-JP-202606 is Liquid AI’s 1.17B Japanese-focused text model for chat, instruction following, math, code, tool use, and bilingual English-Japanese assistant workflows.

NewText

Released 4d ago

🎬Video editing 📽️Image to video 🎥Video effects

P Video Replace

By Pruna AI

P-Video-Replace is Pruna AI’s video-replacement model for inserting referenced people into a source video while preserving realistic motion and audio.

NewVideo

Released 5d ago

💬Chatting 🔄Text rewriting 🔍Intent recognition

Aion 1.0 Instruct

By Microsoft

Aion 1.0 Instruct is Microsoft’s on-device small language model for local text intelligence such as summarization, rewriting, intent handling, and accessibility features on Windows.

NewText

Released 5d ago

💬Chatting 🔎Problem solving 🤖Agents

Nemotron 3 Ultra 550B A55B NVFP4

By NVIDIA

NVIDIA Nemotron-3-Ultra-550B-A55B-NVFP4 is NVIDIA’s open 550B total, 55B active frontier reasoning LLM for agentic workflows, long-context analysis, tool use, code, math, and science.

NewText

Released 6d ago

🔊Text to speech 🗣️Voice cloning 🎙️Voiceovers 🗣Dialogue generation

Higgs Audio v3 TTS

By Boson AI

Higgs Audio v3 TTS is Boson AI’s text-to-speech model for expressive conversational voice agents across 100+ languages with zero-shot voice cloning and inline speech controls.

NewAudio

Released 6d ago

Magenta RealTime 2

By Google DeepMind

Magenta RealTime 2 is Google DeepMind’s open on-device live music-generation model for low-latency, continuously controllable music from text, audio examples, and MIDI.

🎵Music 🎵Music production

NewAudio

Released 6d ago

📷Images 🎨Illustrations 🎨Graphic design

Krea 2 Turbo

By Krea.ai

Krea 2 Turbo is Krea’s faster image-generation model variant for producing high-quality Krea 2 images in about 2 seconds.

NewImage

Released 7d ago

🔊Text to speech 🗣️Voice cloning 🎙️Voiceovers 🗣Dialogue generation

Miso TTS 8B

By Miso Labs

MisoTTS is Miso Labs’ open-weight 8B text-and-audio-conditioned speech generation model for expressive, context-aware, emotive TTS and dialogue voice output.

NewAudio

Released 7d ago

📷Images 🖌️Image editing 🎨Graphic design 🔲Background removal ⚖️Character consistency 🔤Typography

Ideogram 4

By Ideogram AI

Ideogram 4.0 is Ideogram’s open-weight image-generation model for design work, multilingual text rendering, precise layout control, editable elements, and realistic 2K images.

NewMultimodal

Released 7d ago

🔊Text to speech 🗣️Voice cloning 🎙️Voiceovers

MAI Voice 2 Flash

By Microsoft

MAI-Voice-2-Flash is Microsoft AI’s upcoming lower-cost, ultra-efficient variant of MAI-Voice-2 for speech generation.

NewMultimodal

Released 8d ago