Papers

Filter by company

SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning

AMD / University of Chinese Academy of Sciences

Published on: 2025-10-12 1 author
BenchPress: Rapid Benchmark Curation

Intel / MIT

Published on: 2025-10-11 1 author
LLM-based Relevance Assessment for Web-Scale Search Evaluation at Pinterest

Pinterest

Published on: 2025-10-11 1 author
Training-Free Group Relative Policy Optimization

Tencent / Fudan University, Xiamen University

Published on: 2025-10-09 13 authors
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

Microsoft / University of California

Published on: 2025-10-09 5 authors
Efficient and Adaptable Overlapping for Computation and Communication via Signaling and Reordering

AMD / Tsinghua University

Published on: 2025-10-09 1 author
Training-Free Group Relative Policy Optimization

Tencent

Published on: 2025-10-09 1 author
What Is Your Agent's GPA? A Framework for Evaluating Agent Goal-Plan-Action Alignment

Snowflake / Stanford University

Published on: 2025-10-09 1 author
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Alibaba / Zhejiang University

Published on: 2025-10-09 1 author
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

Anthropic / Alan Turing Institute, Oxford Applied and Theoretical Machine Learning, Swiss Federal Institute of Technology in Zurich, UK AI Security Institute, University of Oxford

Published on: 2025-10-08 13 authors
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Amazon / University of Virginia

Published on: 2025-10-08 1 author
Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences

Stanford University

Published on: 2025-10-07 2 authors
SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications

Snowflake

Published on: 2025-10-07 Venue: Carnegie Mellon University 1 author
Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)

The Pennsylvania State University

Published on: 2025-10-06 2 authors
Less is More: Recursive Reasoning with Tiny Networks

Samsung SAIL

Published on: 2025-10-06 1 author
Tongyi DeepResearch Technical Report

Alibaba

Published on: 2025-10-04 1 author
TabArena: A Living Benchmark for Machine Learning on Tabular Data

Amazon / University of Freiburg

Published on: 2025-10-03 1 author
IBM Granite 4.0: hyper-efficient, high performance hybrid models for enterprise

IBM

Published on: 2025-10-02 1 author
Diffusion Adversarial Post-Training for One-Step Video Generatio

ByteDance

Published on: 2025-10-01 1 author
Pretraining Large Language Models with NVFP4

NVIDIA

Published on: 2025-09-29 90 authors
Training Agents Inside of Scalable World Models

Google

Published on: 2025-09-29 1 author
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation

AMD / Carnegie Mellon University

Published on: 2025-09-26 1 author
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

MiniMax

Published on: 2025-09-26 1 author
Inference-Time Scaling for Generalist Reward Modeling

DeepSeek

Published on: 2025-09-25 1 author
MechStyle: Augmenting Generative AI with Mechanical Simulation to Create Stylized and Structurally Viable 3D Models

Google Research, Stability AI / Center for Bits and Atoms, MIT, Khoury College of Computer Sciences, Northeastern University, University of Washington

Published on: 2025-09-24 10 authors
FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models

Apple / The Ohio State University

Published on: 2025-09-24 6 authors
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation

MiniMax / South China University of Technology

Published on: 2025-09-24 1 author
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Baichuan Intelligent Technology

Published on: 2025-09-23 1 author
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Adobe

Published on: 2025-09-23 1 author
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Apple / Hanyang University

Published on: 2025-09-22 5 authors
"My Boyfriend is AI": A Computational Analysis of Human-AI Companionship in Reddit's AI Community

Massachusetts Institute of Technology

Published on: 2025-09-14 6 authors
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Microsoft / MIT

Published on: 2025-09-14 1 author
Towards an AI-Augmented Textbook

Google

Published on: 2025-09-13 34 authors
Steering MoE LLMs via Expert (De)Activation

Adobe

Published on: 2025-09-11 1 author
Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Alibaba

Published on: 2025-09-11 1 author
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Google

Published on: 2025-09-08 1 author
An AI System to Help Scientists Write Expert-Level Empirical Software

Google / MIT

Published on: 2025-09-08 1 author
Why Language Models Hallucinate

OpenAI / Georgia Institute of Technology

Published on: 2025-09-04 4 authors
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Moonshot AI / Tsinghua University

Published on: 2025-09-03 1 author
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Apple / Washington State University

Published on: 2025-08-27 1 author
Measuring the environmental impact of delivering AI at Google Scale

Google

Published on: 2025-08-21
3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds

NVIDIA / Stanford University

Published on: 2025-08-19 1 author
X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms

DeepSeek / University of Illinois Urbana-Champaign

Published on: 2025-08-18 1 author
NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation

Mila / University of Oxford

Published on: 2025-08-17 3 authors
Matrix-3D: Omnidirectional Explorable 3D World Generation

Published on: 2025-08-11
Amazon Ads Multi-Touch Attribution

Amazon / Northwestern University

Published on: 2025-08-11 1 author
Scaling Laws for Native Multimodal Models

Apple / Sorbonne University

Published on: 2025-08-09 1 author
Devstral: Fine-tuning Language Models for Coding Agent Applications

Mistral AI

Published on: 2025-08-08 1 author
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Amazon / Stanford University

Published on: 2025-08-07 1 author
No LLM Solved Yu Tsumura's 554th Problem

University of Cambridge, University of Oxford

Published on: 2025-08-05 2 authors

Prev 41 42 43 44 45 46 47 48 49 50 51 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: