Papers

Filter by company

ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Snowflake

Published on: 2025-03-25 1 author
Gemma 3 Technical Report

Google

Published on: 2025-03-25 1 author
Debunking the CUDA Myth Towards GPU-based AI Systems

Intel / Korea Advanced Institute of Science & Technology

Published on: 2025-03-22
The Amazon Nova Family of Models: Technical Report and Model Card

Amazon

Published on: 2025-03-17 1 author
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error

Z.ai / Tsinghua University

Published on: 2025-03-13 1 author
Long Context Tuning for Video Generation

ByteDance / The Chinese University of Hong Kong

Published on: 2025-03-13 1 author
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Cohere

Published on: 2025-03-12 1 author
HunyuanVideo: A Systematic Framework For Large Video Generative Models

Tencent

Published on: 2025-03-11 1 author
Learning to Search Effective Example Sequences for In-Context Learning

Intuit

Published on: 2025-03-11 1 author
Gemini Embedding: Generalizable Embeddings from Gemini

Google

Published on: 2025-03-10 1 author
Aya Vision: Expanding the worlds AI can see

Cohere

Published on: 2025-03-04 1 author
OWLViz: An Open-World Benchmark for Visual Question Answering

Adobe

Published on: 2025-03-04 1 author
Towards Statistical Factuality Guarantee for Large Vision-Language Models

Intuit / Vanderbilt University

Published on: 2025-02-27 1 author
Evaluating Nova 2.0 Lite model under Amazon’s Frontier Model Safety Framework

Amazon

Published on: 2025-02-27 1 author
AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools

Microsoft

Published on: 2025-02-26 1 author
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Amazon

Published on: 2025-02-25 1 author
Muon is Scalable for LLM Training

Moonshot AI

Published on: 2025-02-24 1 author
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Meta Platforms / University of California

Published on: 2025-02-20 1 author
AToken: A Unified Tokenizer for Vision

Apple

Published on: 2025-02-19 1 author
Qwen2.5-VL Technical Report

Alibaba

Published on: 2025-02-19 1 author
MoBA: Mixture of Block Attention for Long-Context LLMs

Moonshot AI / Tsinghua University

Published on: 2025-02-18 1 author
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

OpenAI

Published on: 2025-02-17 4 authors
GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units

Intel

Published on: 2025-02-13 1 author
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

ByteDance

Published on: 2025-02-04 1 author
s1: Simple test-time scaling

Contextual AI / Allen Institute for Artificial Intelligence, Stanford University, University of Washington

Published on: 2025-01-31 10 authors
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Anthropic / Safeguards Research Team

Published on: 2025-01-31 1 author
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

DeepSeek

Published on: 2025-01-29 1 author
EmbeddingGemma: Powerful and Lightweight Text Representations

Google

Published on: 2025-01-24 1 author
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Amazon

Published on: 2025-01-19 1 author
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Z.ai / Tsinghua University

Published on: 2025-01-17 1 author
MiniMax-01: Scaling Foundation Models with Lightning Attention

MiniMax

Published on: 2025-01-14 1 author
PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Z.ai / Central South University, Tsinghua University

Published on: 2025-01-13 1 author
Agent Laboratory: Using LLM Agents as Research Assistants

AMD / Johns Hopkins University, Swiss Federal Institute of Technology in Zurich

Published on: 2025-01-08 10 authors
Retrieval-Augmented Generation with Graphs (GraphRAG)

Amazon / Michigan State University

Published on: 2025-01-08 1 author
Cosmos World Foundation Model Platform for Physical AI

NVIDIA

Published on: 2025-01-07 1 author
Titans: Learning to Memorize at Test Time

Google

Published on: 2024-12-31 1 author
Generative Video Propagation

Adobe / The Chinese University of Hong Kong

Published on: 2024-12-27 1 author
In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Snowflake

Published on: 2024-12-23 1 author
Qwen2.5 Technical Report

Alibaba

Published on: 2024-12-19 1 author
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Workday / Queen’s University

Published on: 2024-12-18 1 author
Alignment faking in large language models

Anthropic / New York University

Published on: 2024-12-18 1 author
How Often are Fingerprints Repeated in the Population? Expanding on Evidence from AI With the Birthday Paradox

University of Pennsylvania Department of Criminology and Statistics, University of Pennsylvania School of Engineering and Applied Sciences

Published on: 2024-12-17 2 authors
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

DeepSeek

Published on: 2024-12-13 1 author
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure

Google

Published on: 2024-12-12 1 author
pfl-research: simulation framework for accelerating research in Private Federated Learning

Apple

Published on: 2024-12-10 1 author
Frontier AI systems have surpassed the self-replicating red line

Fudan University

Published on: 2024-12-09 4 authors
InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention

Snap / University of California

Published on: 2024-12-09 1 author
Best-of-N Jailbreaking

Anthropic, Tangentic / MATS, Stanford University, University College London, University of Oxford

Published on: 2024-12-04 10 authors
Creating realistic 3D shapes using generative AI

Massachusetts Institute of Technology

Published on: 2024-12-04 1 author
Commit0: Library Generation from Scratch

Cohere / Cornell University

Published on: 2024-12-02 1 author

Prev 64 65 66 67 68 69 70 71 72 73 74 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: