Papers

Filter by company

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

ByteDance

Published on: 2025-04-11 1 author
PixelFlow: Pixel-Space Generative Models with Flow

Adobe / The University of Hong Kong

Published on: 2025-04-10 5 authors
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

ByteDance

Published on: 2025-04-10 1 author
SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Cisco Systems / The Ohio State University

Published on: 2025-04-09 11 authors
Gemini: A Family of Highly Capable Multimodal Models

Google / Google DeepMind

Published on: 2025-04-09 1 author
SmolVLM: Redefining small and efficient multimodal models

Hugging Face / Stanford University

Published on: 2025-04-07 1 author
One-Minute Video Generation with Test-Time Training

NVIDIA / Stanford University

Published on: 2025-04-07 1 author
Data Scaling Laws for End-to-End Autonomous Driving

NVIDIA / New York University, Stanford University, University of Toronto

Published on: 2025-04-06 10 authors
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

MiniMax / Shanghai Jiao Tong University

Published on: 2025-04-04 1 author
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Together AI / Carnegie Mellon University

Published on: 2025-04-02 1 author
A Systematic Survey of Automatic Prompt Optimization Techniques

Amazon

Published on: 2025-04-02 1 author
Scaling Language-Free Visual Representation Learning

Meta Platforms / New York University, Princeton University

Published on: 2025-04-01 11 authors
Large Language Models Pass the Turing Test

Published on: 2025-03-31 2 authors
XAMBA: SSMs on Edge NPUs

Intel / Purdue University

Published on: 2025-03-31 1 author
On the Biology of a Large Language Model

Anthropic

Published on: 2025-03-27 1 author
Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration

Alibaba

Published on: 2025-03-26 1 author
Qwen2.5-Omni Technical Report

Alibaba

Published on: 2025-03-26 1 author
Neuromorphic Principles for Efficient Large Language Models on Intel Loihi 2

Intel / University of California

Published on: 2025-03-25 1 author
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Snowflake

Published on: 2025-03-25 1 author
Gemma 3 Technical Report

Google

Published on: 2025-03-25 1 author
Debunking the CUDA Myth Towards GPU-based AI Systems

Intel / Korea Advanced Institute of Science & Technology

Published on: 2025-03-22
The Amazon Nova Family of Models: Technical Report and Model Card

Amazon

Published on: 2025-03-17 1 author
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error

Z.ai / Tsinghua University

Published on: 2025-03-13 1 author
Long Context Tuning for Video Generation

ByteDance / The Chinese University of Hong Kong

Published on: 2025-03-13 1 author
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Cohere

Published on: 2025-03-12 1 author
HunyuanVideo: A Systematic Framework For Large Video Generative Models

Tencent

Published on: 2025-03-11 1 author
Learning to Search Effective Example Sequences for In-Context Learning

Intuit

Published on: 2025-03-11 1 author
Gemini Embedding: Generalizable Embeddings from Gemini

Google

Published on: 2025-03-10 1 author
Aya Vision: Expanding the worlds AI can see

Cohere

Published on: 2025-03-04 1 author
OWLViz: An Open-World Benchmark for Visual Question Answering

Adobe

Published on: 2025-03-04 1 author
Towards Statistical Factuality Guarantee for Large Vision-Language Models

Intuit / Vanderbilt University

Published on: 2025-02-27 1 author
Evaluating Nova 2.0 Lite model under Amazon’s Frontier Model Safety Framework

Amazon

Published on: 2025-02-27 1 author
AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools

Microsoft

Published on: 2025-02-26 1 author
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Amazon

Published on: 2025-02-25 1 author
Muon is Scalable for LLM Training

Moonshot AI

Published on: 2025-02-24 1 author
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Meta Platforms / University of California

Published on: 2025-02-20 1 author
AToken: A Unified Tokenizer for Vision

Apple

Published on: 2025-02-19 1 author
Qwen2.5-VL Technical Report

Alibaba

Published on: 2025-02-19 1 author
MoBA: Mixture of Block Attention for Long-Context LLMs

Moonshot AI / Tsinghua University

Published on: 2025-02-18 1 author
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Published on: 2025-02-17 4 authors
GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units

Intel

Published on: 2025-02-13 1 author
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

ByteDance

Published on: 2025-02-04 1 author
s1: Simple test-time scaling

Published on: 2025-01-31 10 authors
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Anthropic / Safeguards Research Team

Published on: 2025-01-31 1 author
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

DeepSeek

Published on: 2025-01-29 1 author
EmbeddingGemma: Powerful and Lightweight Text Representations

Google

Published on: 2025-01-24 1 author
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Amazon

Published on: 2025-01-19 1 author
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Z.ai / Tsinghua University

Published on: 2025-01-17 1 author
MiniMax-01: Scaling Foundation Models with Lightning Attention

MiniMax

Published on: 2025-01-14 1 author
PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Z.ai / Central South University, Tsinghua University

Published on: 2025-01-13 1 author

Prev 30 31 32 33 34 35 36 37 38 39 40 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: