-
By BaiduPP-OCRv6 is PaddlePaddle/Baidu’s lightweight universal OCR system for multilingual text detection and recognition across edge, mobile, desktop, and server deployments.NewMultimodalReleased 1d ago
-
By Z.aiGLM-5.2 is Z.AI’s high-capability model for complex reasoning, large-scale engineering, agentic coding, and difficult long-horizon development tasks.NewMultimodalReleased 3d ago
-
By Zyphra AIZONOS2 is Zyphra’s open-source real-time text-to-speech model with MoE architecture, high-fidelity zero-shot voice cloning, and multilingual expressive speech generation.NewMultimodalReleased 4d ago
-
By PhotoroomPRXPixel-T2I is Photoroom’s Apache 2.0 pixel-space text-to-image model that denoises raw RGB directly instead of using a VAE.NewMultimodalReleased 4d ago
-
By World LabsFlex4DHuman is a multi-view video diffusion model that turns monocular or sparse multi-view videos of dynamic subjects into synchronized dense multi-view videos for 4D human reconstruction.NewMultimodalReleased 5d ago
-
By GoogleDiffusionGemma is Google’s open experimental 26B MoE text-diffusion model for faster local text generation, parallel editing, code infilling, and non-linear text workflows.NewMultimodalReleased 6d ago
-
DeCAF-Pearl is Genesis Molecular AI’s distilled all-atom cofolding model for faster protein-ligand structure prediction and molecular-design screening.New3dReleased 6d ago
-
Rhaister is Tahoe Bio’s open model for predicting cellular drug and perturbation responses from aggregated single-cell summary statistics instead of raw single-cell data.NewStructured DataReleased 6d ago
-
By SimpleDirectflash-1-mini is SimpleDirect’s open-weight 4B bilingual legal AI model for Canadian legal citation accuracy, instruction following, and English-French legal or compliance workflows.NewMultimodalReleased 7d ago
-
By AnthropicClaude Mythos 5 is Anthropic’s restricted-access version of the same underlying model as Claude Fable 5, designed for trusted cyberdefense, infrastructure security, and selected high-impact research use.NewMultimodalReleased 7d ago
-
By AnthropicClaude Fable 5 is Anthropic’s generally available Mythos-class Claude model for frontier reasoning, software engineering, knowledge work, vision, long-horizon tasks, and scientific research, with added safeguards for high-risk domains.NewMultimodalReleased 7d ago
-
By CohereNorth Mini Code 1.0 is Cohere’s Apache 2.0 agentic coding model with 30B total parameters and 3B active parameters, built for repo-level software engineering, terminal agents, local coding, and code generation.NewCodingReleased 7d ago
-
Gemini Audio is Google DeepMind’s closed-source native audio model family for low-latency live dialogue, controllable speech generation, audio understanding, and voice-first applications.NewAudioReleased 7d ago
-
By XiaomiMiMo-V2.5-Pro-UltraSpeed is Xiaomi MiMo’s 1T-parameter high-speed reasoning model variant built with TileRT to reach 1000+ tokens per second generation on commodity GPU infrastructure.NewTextReleased 8d ago
-
By MicrosoftAion 1.0 Plan is Microsoft’s 14B on-device reasoning and tool-calling model for fully local agentic workflows on capable Windows devices.NewMultimodalReleased 8d ago
-
By Liquid AILFM2.5-VL-1.6B-Extract is Liquid AI’s larger vision-language extraction model for image-to-JSON structured field extraction.NewMultimodalReleased 8d ago
-
By LatentForceCassini-1.0 is LatentForce’s 4B code-intelligence model for extracting structured symbolic metadata from Python, JavaScript, and TypeScript source files.NewTextReleased 9d ago
-
By Liquid AILFM2.5-1.2B-JP-202606 is Liquid AI’s 1.17B Japanese-focused text model for chat, instruction following, math, code, tool use, and bilingual English-Japanese assistant workflows.NewTextReleased 10d ago
-
By Pruna AIP-Video-Replace is Pruna AI’s video-replacement model for inserting referenced people into a source video while preserving realistic motion and audio.NewVideoReleased 11d ago
-
By MicrosoftAion 1.0 Instruct is Microsoft’s on-device small language model for local text intelligence such as summarization, rewriting, intent handling, and accessibility features on Windows.NewTextReleased 11d ago
-
By NVIDIANVIDIA Nemotron-3-Ultra-550B-A55B-NVFP4 is NVIDIA’s open 550B total, 55B active frontier reasoning LLM for agentic workflows, long-context analysis, tool use, code, math, and science.NewTextReleased 12d ago
MongoDB - Build AI That Scales
