-
Gemini Audio is Google DeepMindโs closed-source native audio model family for low-latency live dialogue, controllable speech generation, audio understanding, and voice-first applications.NewAudioReleased 1d ago
-
By XiaomiMiMo-V2.5-Pro-UltraSpeed is Xiaomi MiMoโs 1T-parameter high-speed reasoning model variant built with TileRT to reach 1000+ tokens per second generation on commodity GPU infrastructure.NewTextReleased 2d ago
-
By MicrosoftAion 1.0 Plan is Microsoftโs 14B on-device reasoning and tool-calling model for fully local agentic workflows on capable Windows devices.NewMultimodalReleased 2d ago
-
By Liquid AILFM2.5-VL-1.6B-Extract is Liquid AIโs larger vision-language extraction model for image-to-JSON structured field extraction.NewMultimodalReleased 2d ago
-
By LatentForceCassini-1.0 is LatentForceโs 4B code-intelligence model for extracting structured symbolic metadata from Python, JavaScript, and TypeScript source files.NewTextReleased 3d ago
-
By Liquid AILFM2.5-1.2B-JP-202606 is Liquid AIโs 1.17B Japanese-focused text model for chat, instruction following, math, code, tool use, and bilingual English-Japanese assistant workflows.NewTextReleased 4d ago
-
By Pruna AIP-Video-Replace is Pruna AIโs video-replacement model for inserting referenced people into a source video while preserving realistic motion and audio.NewVideoReleased 5d ago
-
By MicrosoftAion 1.0 Instruct is Microsoftโs on-device small language model for local text intelligence such as summarization, rewriting, intent handling, and accessibility features on Windows.NewTextReleased 5d ago
-
By NVIDIANVIDIA Nemotron-3-Ultra-550B-A55B-NVFP4 is NVIDIAโs open 550B total, 55B active frontier reasoning LLM for agentic workflows, long-context analysis, tool use, code, math, and science.NewTextReleased 6d ago
-
By Boson AIHiggs Audio v3 TTS is Boson AIโs text-to-speech model for expressive conversational voice agents across 100+ languages with zero-shot voice cloning and inline speech controls.NewAudioReleased 6d ago
-
Magenta RealTime 2 is Google DeepMindโs open on-device live music-generation model for low-latency, continuously controllable music from text, audio examples, and MIDI.NewAudioReleased 6d ago
-
By Krea.aiKrea 2 Turbo is Kreaโs faster image-generation model variant for producing high-quality Krea 2 images in about 2 seconds.NewImageReleased 7d ago
-
By Miso LabsMisoTTS is Miso Labsโ open-weight 8B text-and-audio-conditioned speech generation model for expressive, context-aware, emotive TTS and dialogue voice output.NewAudioReleased 7d ago
-
By Ideogram AIIdeogram 4.0 is Ideogramโs open-weight image-generation model for design work, multilingual text rendering, precise layout control, editable elements, and realistic 2K images.NewMultimodalReleased 7d ago
-
By MicrosoftMAI-Voice-2-Flash is Microsoft AIโs upcoming lower-cost, ultra-efficient variant of MAI-Voice-2 for speech generation.NewMultimodalReleased 8d ago
-
By MicrosoftMAI-Voice-2 is Microsoft AIโs speech generation model for natural-sounding voice output across 15 languages with short-sample voice adaptation.NewMultimodalReleased 8d ago
-
By MicrosoftMAI Transcribe-1.5 is Microsoft AIโs transcription model for accurate, fast, domain-specific speech-to-text across 43 languages.NewMultimodalReleased 8d ago
-
By MicrosoftMAI-Image-2.5 Flash is Microsoft AIโs ultra-efficient variant of MAI-Image-2.5 for lower-cost image generation and editing.NewImageReleased 8d ago
-
By MicrosoftMAI-Image-2.5 is Microsoft AIโs image model for text-to-image generation and image editing with strong design-ready output quality.NewImageReleased 8d ago
-
By MicrosoftMAI-Code-1-Flash is Microsoft AIโs 5B inference-efficient agentic coding model integrated into GitHub Copilot, VS Code, and the Microsoft stack.NewCodingReleased 8d ago
-
By MicrosoftMAI-Thinking-1 is Microsoft AIโs flagship reasoning model for complex reasoning, software engineering, math, and high-difficulty problem solving.NewMultimodalReleased 8d ago
