Papers

Filter by company

Training-Free Group Relative Policy Optimization

Tencent

Published on: 2025-10-09 1 author
What Is Your Agent's GPA? A Framework for Evaluating Agent Goal-Plan-Action Alignment

Snowflake / Stanford University

Published on: 2025-10-09 1 author
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Alibaba / Zhejiang University

Published on: 2025-10-09 1 author
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

Published on: 2025-10-08 13 authors
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Amazon / University of Virginia

Published on: 2025-10-08 1 author
Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences

Published on: 2025-10-07 2 authors
SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications

Snowflake

Published on: 2025-10-07 Venue: Carnegie Mellon University 1 author
Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)

Published on: 2025-10-06 2 authors
Less is More: Recursive Reasoning with Tiny Networks

Published on: 2025-10-06 1 author
Tongyi DeepResearch Technical Report

Alibaba

Published on: 2025-10-04 1 author
TabArena: A Living Benchmark for Machine Learning on Tabular Data

Amazon / University of Freiburg

Published on: 2025-10-03 1 author
IBM Granite 4.0: hyper-efficient, high performance hybrid models for enterprise

IBM

Published on: 2025-10-02 1 author
Diffusion Adversarial Post-Training for One-Step Video Generatio

ByteDance

Published on: 2025-10-01 1 author
Pretraining Large Language Models with NVFP4

Published on: 2025-09-29 90 authors
Training Agents Inside of Scalable World Models

Google

Published on: 2025-09-29 1 author
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation

AMD / Carnegie Mellon University

Published on: 2025-09-26 1 author
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

MiniMax

Published on: 2025-09-26 1 author
Inference-Time Scaling for Generalist Reward Modeling

DeepSeek

Published on: 2025-09-25 1 author
MechStyle: Augmenting Generative AI with Mechanical Simulation to Create Stylized and Structurally Viable 3D Models

Published on: 2025-09-24 10 authors
FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models

Published on: 2025-09-24 6 authors
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation

MiniMax / South China University of Technology

Published on: 2025-09-24 1 author
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Baichuan Intelligent Technology

Published on: 2025-09-23 1 author
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Adobe

Published on: 2025-09-23 1 author
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Published on: 2025-09-22 5 authors
"My Boyfriend is AI": A Computational Analysis of Human-AI Companionship in Reddit's AI Community

Published on: 2025-09-14 6 authors
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Microsoft / MIT

Published on: 2025-09-14 1 author
Towards an AI-Augmented Textbook

Published on: 2025-09-13 37 authors
Steering MoE LLMs via Expert (De)Activation

Adobe

Published on: 2025-09-11 1 author
Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Alibaba

Published on: 2025-09-11 1 author
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Google

Published on: 2025-09-08 1 author
An AI System to Help Scientists Write Expert-Level Empirical Software

Google / MIT

Published on: 2025-09-08 1 author
Why Language Models Hallucinate

Published on: 2025-09-04 4 authors
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Moonshot AI / Tsinghua University

Published on: 2025-09-03 1 author
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Apple / Washington State University

Published on: 2025-08-27 1 author
Measuring the environmental impact of delivering AI at Google Scale

Google

Published on: 2025-08-21
3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds

NVIDIA / Stanford University

Published on: 2025-08-19 1 author
X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms

DeepSeek / University of Illinois Urbana-Champaign

Published on: 2025-08-18 1 author
NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation

Mila / University of Oxford

Published on: 2025-08-17 3 authors
Matrix-3D: Omnidirectional Explorable 3D World Generation

Published on: 2025-08-11
Amazon Ads Multi-Touch Attribution

Amazon / Northwestern University

Published on: 2025-08-11 1 author
Scaling Laws for Native Multimodal Models

Apple / Sorbonne University

Published on: 2025-08-09 1 author
Devstral: Fine-tuning Language Models for Coding Agent Applications

Mistral AI

Published on: 2025-08-08 1 author
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Amazon / Stanford University

Published on: 2025-08-07 1 author
No LLM Solved Yu Tsumura's 554th Problem

Published on: 2025-08-05 2 authors
Why do LLMs attend to the first token

Google / University of Oxford

Published on: 2025-08-05 7 authors
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Amazon / Princeton University

Published on: 2025-08-05 1 author
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Published on: 2025-08-05 1 author
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

ByteDance / Tsinghua University

Published on: 2025-08-04 1 author
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

MetaGPT, Google, Microsoft / Nanyang Technological University, Université de Montréal, University of Illinois at Urbana-Champaign

Published on: 2025-08-02 1 author
Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks

AMD

Published on: 2025-07-31 1 author

Prev 32 33 34 35 36 37 38 39 40 41 42 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: