Papers

Filter by company

Long Context Tuning for Video Generation

ByteDance / The Chinese University of Hong Kong

Published on: 2025-03-13 1 author
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Cohere

Published on: 2025-03-12 1 author
HunyuanVideo: A Systematic Framework For Large Video Generative Models

Tencent

Published on: 2025-03-11 1 author
Learning to Search Effective Example Sequences for In-Context Learning

Intuit

Published on: 2025-03-11 1 author
Gemini Embedding: Generalizable Embeddings from Gemini

Google

Published on: 2025-03-10 1 author
Aya Vision: Expanding the worlds AI can see

Cohere

Published on: 2025-03-04 1 author
OWLViz: An Open-World Benchmark for Visual Question Answering

Adobe

Published on: 2025-03-04 1 author
Towards Statistical Factuality Guarantee for Large Vision-Language Models

Intuit / Vanderbilt University

Published on: 2025-02-27 1 author
Evaluating Nova 2.0 Lite model under Amazon’s Frontier Model Safety Framework

Amazon

Published on: 2025-02-27 1 author
AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools

Microsoft

Published on: 2025-02-26 1 author
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Amazon

Published on: 2025-02-25 1 author
Muon is Scalable for LLM Training

Moonshot AI

Published on: 2025-02-24 1 author
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Meta Platforms / University of California

Published on: 2025-02-20 1 author
AToken: A Unified Tokenizer for Vision

Apple

Published on: 2025-02-19 1 author
Qwen2.5-VL Technical Report

Alibaba

Published on: 2025-02-19 1 author
MoBA: Mixture of Block Attention for Long-Context LLMs

Moonshot AI / Tsinghua University

Published on: 2025-02-18 1 author
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Published on: 2025-02-17 4 authors
GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units

Intel

Published on: 2025-02-13 1 author
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

ByteDance

Published on: 2025-02-04 1 author
s1: Simple test-time scaling

Published on: 2025-01-31 10 authors
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Anthropic / Safeguards Research Team

Published on: 2025-01-31 1 author
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

DeepSeek

Published on: 2025-01-29 1 author
EmbeddingGemma: Powerful and Lightweight Text Representations

Google

Published on: 2025-01-24 1 author
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Amazon

Published on: 2025-01-19 1 author
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Z.ai / Tsinghua University

Published on: 2025-01-17 1 author
MiniMax-01: Scaling Foundation Models with Lightning Attention

MiniMax

Published on: 2025-01-14 1 author
PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Z.ai / Central South University, Tsinghua University

Published on: 2025-01-13 1 author
Agent Laboratory: Using LLM Agents as Research Assistants

Published on: 2025-01-08 10 authors
Retrieval-Augmented Generation with Graphs (GraphRAG)

Amazon / Michigan State University

Published on: 2025-01-08 1 author
Cosmos World Foundation Model Platform for Physical AI

NVIDIA

Published on: 2025-01-07 1 author
Titans: Learning to Memorize at Test Time

Google

Published on: 2024-12-31 1 author
Generative Video Propagation

Adobe / The Chinese University of Hong Kong

Published on: 2024-12-27 1 author
In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Snowflake

Published on: 2024-12-23 1 author
Qwen2.5 Technical Report

Alibaba

Published on: 2024-12-19 1 author
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Workday / Queen’s University

Published on: 2024-12-18 1 author
Alignment faking in large language models

Anthropic / New York University

Published on: 2024-12-18 1 author
How Often are Fingerprints Repeated in the Population? Expanding on Evidence from AI With the Birthday Paradox

Published on: 2024-12-17 2 authors
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

DeepSeek

Published on: 2024-12-13 1 author
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure

Google

Published on: 2024-12-12 1 author
pfl-research: simulation framework for accelerating research in Private Federated Learning

Apple

Published on: 2024-12-10 1 author
Frontier AI systems have surpassed the self-replicating red line

Published on: 2024-12-09 4 authors
InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention

Snap / University of California

Published on: 2024-12-09 1 author
Best-of-N Jailbreaking

Published on: 2024-12-04 10 authors
Creating realistic 3D shapes using generative AI

Massachusetts Institute of Technology

Published on: 2024-12-04 1 author
Commit0: Library Generation from Scratch

Cohere / Cornell University

Published on: 2024-12-02 1 author
ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models

Adobe / Duke University

Published on: 2024-12-02
Controlling Language and Diffusion Models by Transporting Activations

Apple

Published on: 2024-11-22 1 author
The Rise and Potential of Large Language Model Based Agents: A Survey

MIT

Published on: 2024-11-11 5 authors
Evaluating Cultural and Social Awareness of LLM Web Agents

Salesforce

Published on: 2024-10-30 1 author
SF-V: Single Forward Video Generation Model

Snap / Rutgers University

Published on: 2024-10-24 1 author

Prev 28 29 30 31 32 33 34 35 36 37 38 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: