AI Models Directory
Browse and discover AI models from leading companies in the industry.
to
-
Trinity Large Preview is a 400B-parameter (about 13B active) frontier Trinity MoE model with long-context comprehension, tuned for strong reasoning, coding and multi-step agents, served via hosted preview APIs.NewTextReleased 4d ago
-
Trinity Mini is a 26B-parameter (3B active) Trinity MoE model with 128K context, tuned for strong reasoning, function calling and multi-step agents while remaining efficient for enterprise backends.NewTextReleased 1mo ago
-
Trinity Nano Base is the base 6B (1B active) Trinity Nano checkpoint, pre–fine-tuning, meant to be domain-tuned rather than used directly for chatting, trained on about 10T tokens under Apache-2.0.NewTextReleased 1mo ago
-
By AlibabaMultilingual forced alignment model that aligns speech and transcripts in 11 languages, predicting timestamps for arbitrary units in up to 5 minutes of audio with accuracy surpassing previous end-to-end aligners.NewAudioReleased 2d ago
-
Second Abstraction and Reasoning Corpus for AGI, with 1,000 training and 120 evaluation grid tasks plus private test sets, designed to test fluid intelligence and program-synthesis style reasoning.TextReleased 10mo ago
-
Python toolkit exposing a unified Arcade API for ARC-AGI-3 puzzle environments, letting agents create, step, and render games and manage scorecards locally or against the ARC Prize online service.TextReleased 7y ago
-
By DeepSeekSecond-generation DeepSeek OCR model, “Visual Causal Flow,” aimed at more human-like visual encoding, with dynamic resolution support and strong document-to-Markdown and layout-aware OCR for images and PDFs.NewTextReleased 4d ago
-
By DeepSeekLLM-centric OCR model using “Contexts Optical Compression” to explore visual-text compression and provide fast streaming and batch OCR for images and PDFs via vLLM and Transformers runtimes.TextReleased 3mo ago
-
FIBO is Bria’s 8B-parameter, JSON-native text-to-image model built for deterministic, hyper-controllable and legally safe image generation, using long structured captions to give precise, repeatable control over lighting, camera, color and layout for enterprise workflows.NewImageReleased 2mo ago
-
mxbai-embed-large-v1 is Mixedbread’s 340M English-only sentence embedding model that maps text into 1024-dim vectors, trained on 700M+ pairs and 30M triplets, delivering state-of-the-art MTEB performance for retrieval, RAG, clustering and classification.TextReleased 1y ago
-
By AlibabaQwen3-ASR-Flash is Alibaba Qwen’s all-in-one speech recognition model, built on Qwen3-Omni to stream low-latency transcripts in 11 languages, robust to noise, music and code switching, with optional text prompts to bias recognition.TextReleased 4mo ago
-
By PixversePixVerse V3.5 is an ImagineArt-exclusive PixVerse model that turns text or images into 1080p cinematic videos with ultra-fast generation, rich styles, precise start/end frame control, flexible aspect ratios and advanced camera motion.VideoReleased 1y ago
-
By PixversePixVerse V3 is a major update released on Oct 29, 2024, adding smarter prompt understanding, support for many aspect ratios, more styles and a powerful lipsync feature with special effects and one-click video extension.VideoReleased 1y ago
-
By PixversePixVerse V2.5 extends V2 with longer videos, character-to-video generation, Magic Brush motion control, rich camera moves and 4K upscaling, launched in August 2024 as a higher-quality text and image to video model.VideoReleased 1y ago
-
By PixversePixVerse V2 is an AI text and image to video model that creates up to 8-second clips with higher detail, enhanced motion, multiple styles and strong character consistency, plus one-click video extension and a large effect library.VideoReleased 1y ago
-
By PixversePixVerse V4 is ImagineArt’s AI image and text to video model that turns stills or prompts into 5 second clips with lifelike physics, smooth motion and cinematic effects, ideal for viral hugs, body morphs and object to robot transformations.VideoReleased 11mo ago
-
By PixversePixVerse V5 is an advanced text and image to video model on ImagineArt that generates cinematic, high motion realism clips with keyframe control, camera and style customization, plus fusion modes to extend or remix existing footage.VideoReleased 5mo ago
-
By PixversePixVerse V5.5 is PixVerse’s audio-visual text and image to video model that generates 5-10 s 1080p multi-shot clips with native speech, music and SFX, improved motion stability and multi-shot camera control for story driven, lip-synced short videos.NewVideoReleased 1mo ago
-
By PixversePixVerse V5.6 is PixVerse’s latest video model, upgrading V5.5 with cinema level visuals, more natural multilingual voices, smoother physics aware motion and less warping, while keeping generation speed and cost roughly the same as earlier V5 models.NewVideoReleased 5d ago
-
NewTextReleased 12d ago
-
NewTextReleased 10d ago
No models found
Try adjusting your search or filters.
...
