-
By AnthropicClaude Sonnet 5 is Anthropic's large language model, the most agentic Sonnet released so far. It plans autonomously, uses tools such as browsers and terminals, and sustains long multi step coding and computer use tasks. Its agentic performance on reasoning, tool use, and coding approaches Opus 4.8 while costing less, and it supports selectable effort levels for balancing speed, cost, and capability.NewMultimodalReleased 1d ago
-
By MeituanLongCat-2.0 is an open-source MoE language model with 1.6 trillion total parameters (~48B activated per token) and a 1M-token context window. It introduces LongCat Sparse Attention for efficient long-context processing and was pretrained on 35+ trillion tokens using AI ASIC hardware. Strong focus on coding, agentic, and reasoning tasks.NewTextReleased 1d ago
-
Ornith-1.0-397B-FP8 is DeepReinforce's open-source 397B MoE agentic coding model with FP8 quantization. Built on Qwen 3.5 and RL-trained to jointly optimize scaffolds and solutions, it achieves state-of-the-art performance on Terminal-Bench 2.1, SWE-Bench, and NL2Repo. Supports reasoning, tool use, and 262K context. MIT licensed.NewTextReleased 1d ago
-
By Wix.comBase 1 is a model trained by Base44 specifically for building web applications inside the Base44 platform. Trained on millions of real building sessions, it is designed not only to write code but to make product decisions, anticipate user needs, and act as a technical partner throughout the app creation process.NewCodingReleased 2d ago
-
End-to-end deep learning pipeline that decodes text in real time from non-invasive magnetoencephalography (MEG) brain recordings, requiring no surgical implant. Trained on approximately 22,000 sentences from nine participants, it achieves 61% word accuracy -- far above the 8% benchmark for other non-invasive methods -- by fine-tuning large language models on raw neural signals.NewMultimodalReleased 2d ago
-
Rampart is a 14.7 MB quantized ONNX token-classification model for client-side PII detection and redaction. Fine-tuned from MiniLM-L6-H384 with a 35-label BIO head covering 17 entity types, it supports 7 Latin-script languages. Designed to run in browsers via ONNX Runtime Web and Transformers.js at 6.6 ms p50 latency.NewTextReleased 2d ago
-
A 4B parameter instruction-tuned language model created via pre-training distillation from the Apertus-8B teacher model. Uses a dense transformer decoder with grouped-query attention and xIELU activations, trained on 1.7T tokens. Natively supports 1811 languages with quantized checkpoints available for mobile and edge deployment. Apache-2.0 licensed.NewTextReleased 2d ago
-
Apertus-v1.1-4B is a 4B parameter open-source base language model created via pre-training distillation from the Apertus-8B teacher model. It uses a dense transformer with grouped-query attention and xIELU activations, natively supports 1,811 languages, and is trained on 1.7T high-quality tokens. Designed for constrained hardware and further fine-tuning.NewTextReleased 2d ago
-
Ornith-1.0-35B is a 35B Mixture-of-Experts reasoning model for agentic coding, post-trained on Qwen 3.5 using a self-improving RL framework that jointly learns solution rollouts and the task-specific scaffolds guiding them. Supports native function calling and 262K context. Scores 75.6 on SWE-Bench Verified and 64.2 on Terminal-Bench 2.1. MIT licensed.NewCodingReleased 4d ago
-
Open-source 9B-parameter language model specialized for agentic coding tasks. Post-trained on Qwen 3.5 using a self-improving RL framework that jointly learns to generate solutions and task-specific scaffolds. Achieves state-of-the-art results on SWE-bench Verified (69.4%) and Terminal-Bench 2.1 among comparable models. MIT licensed.NewTextReleased 4d ago
-
Ornith-1.0-397B is a 397B MoE open-source reasoning model for agentic coding, post-trained on Qwen 3.5 MoE via a self-improving RL framework that jointly learns task solutions and the scaffolds guiding them. Achieves 82.4 on SWE-Bench Verified and 77.5 on Terminal-Bench 2.1. MIT licensed with tool-calling and 256K context support.NewMultimodalReleased 4d ago
-
By OpenAIGPT 5.6 Terra is a balanced model in OpenAI's GPT 5.6 preview series. It offers performance competitive with GPT 5.5 while costing about half as much. Terra shows improved coding and cybersecurity capability that scales with reasoning effort, and is paired with a layered safety and misuse safeguard stack during its limited preview release.NewTextReleased 5d ago
-
By OpenAIGPT 5.6 Luna is the fast and affordable tier of OpenAI's GPT 5.6 preview series. It is the lowest cost model in the family while still delivering strong capability, including cybersecurity gains that scale with reasoning effort. Luna is released as part of a limited, safeguard paired preview alongside the Sol and Terra models.NewTextReleased 5d ago
-
By OpenAIGPT-5.6 Sol is OpenAI's flagship model in the GPT-5.6 series. It achieves state-of-the-art performance on coding (Terminal-Bench 2.1), biology (GeneBench v1), and cybersecurity (ExploitBench) benchmarks. Features a new max reasoning effort setting and ultra mode for multi-agent parallelism. Priced at $5/$30 per 1M input/output tokens, currently in limited preview.NewTextReleased 5d ago
-
Ornith-1.0-35B-FP8 is an FP8-quantized 35B-parameter mixture-of-experts language model specialized for agentic coding. Built on Qwen3.5 MoE, it is trained with a self-improving RL framework that jointly learns to solve coding tasks and generate the scaffolds guiding those solutions. MIT-licensed and available for self-hosting via vLLM, SGLang, and Transformers.NewTextReleased 5d ago
-
By Liquid AILFM2.5-230M is a 230M-parameter lightweight language model built on the LFM2 architecture, designed for fast inference across edge and cloud environments. Pre-trained on 19T tokens with a 32K context window, it targets tool use, data extraction, and agentic workflows. Runs on CPUs including Raspberry Pi 5 and mobile SoCs. Not recommended for math, code generation, or creative writing.NewTextReleased 6d ago
-
By ByteDanceSeed Audio 1.0 is ByteDance's universal audio generation model that creates voice, music, sound effects, and ambient soundscapes from text prompts. It supports zero-shot voice cloning from short audio references, multi-character dialogue generation in a single pass, and cross-lingual synthesis without fine-tuning. Accessible via Volcano Engine API.NewAudioReleased 8d ago
-
By AlibabaQwen-AgentWorld-35B-A3B is a 35B Mixture-of-Experts language world model with 3B active parameters, built on Qwen3.5-35B-A3B-Base. It natively simulates seven agent interaction domains: MCP tool calling, search, terminal, software engineering, Android, web, and game environments. Trained end-to-end via CPT to SFT to RL. Supports 262,144-token context and uses thinking mode by default.NewMultimodalReleased 8d ago
-
By Mistral AIMistral OCR 4 extracts and structures content from PDF, DOC, PPT, and OpenDocument files, returning text alongside bounding boxes, typed block classification (titles, tables, equations, signatures), and inline confidence scores. Supports 170 languages across 10 language groups. Deployable via API or self-hosted in a single container for data-sovereignty compliance.NewMultimodalReleased 8d ago
-
By Krea.aiKrea 2 Raw is the base pre-training checkpoint of the Krea 2 text-to-image diffusion model, featuring a Diffusion Transformer architecture with 12 billion parameters. Not optimized for direct inference, it is intended for fine-tuning, LoRA training, or post-training workflows.NewImageReleased 9d ago
-
By AlibabaHappyHorse-1.1-T2V is a text-to-video generation model with improved semantic understanding, cinematic shot control, and dynamic motion rendering. Produces 720P or 1080P video outputs with smooth motion, rich detail, strong visual consistency, and natural character actions, scene atmosphere, and physical dynamics.NewVideoReleased 9d ago
