Papers

Filter by company

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Microsoft / MIT

Published on: 2025-09-14 1 author
Towards an AI-Augmented Textbook

Published on: 2025-09-13 37 authors
Steering MoE LLMs via Expert (De)Activation

Adobe

Published on: 2025-09-11 1 author
Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Alibaba

Published on: 2025-09-11 1 author
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Google

Published on: 2025-09-08 1 author
An AI System to Help Scientists Write Expert-Level Empirical Software

Google / MIT

Published on: 2025-09-08 1 author
Why Language Models Hallucinate

Published on: 2025-09-04 4 authors
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Moonshot AI / Tsinghua University

Published on: 2025-09-03 1 author
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Apple / Washington State University

Published on: 2025-08-27 1 author
Measuring the environmental impact of delivering AI at Google Scale

Google

Published on: 2025-08-21
3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds

NVIDIA / Stanford University

Published on: 2025-08-19 1 author
X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms

DeepSeek / University of Illinois Urbana-Champaign

Published on: 2025-08-18 1 author
NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation

Mila / University of Oxford

Published on: 2025-08-17 3 authors
Matrix-3D: Omnidirectional Explorable 3D World Generation

Published on: 2025-08-11
Amazon Ads Multi-Touch Attribution

Amazon / Northwestern University

Published on: 2025-08-11 1 author
Scaling Laws for Native Multimodal Models

Apple / Sorbonne University

Published on: 2025-08-09 1 author
Devstral: Fine-tuning Language Models for Coding Agent Applications

Mistral AI

Published on: 2025-08-08 1 author
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Amazon / Stanford University

Published on: 2025-08-07 1 author
No LLM Solved Yu Tsumura's 554th Problem

Published on: 2025-08-05 2 authors
Why do LLMs attend to the first token

Google / University of Oxford

Published on: 2025-08-05 7 authors
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Amazon / Princeton University

Published on: 2025-08-05 1 author
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Published on: 2025-08-05 1 author
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

ByteDance / Tsinghua University

Published on: 2025-08-04 1 author
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

MetaGPT, Google, Microsoft / Nanyang Technological University, Université de Montréal, University of Illinois at Urbana-Champaign

Published on: 2025-08-02 1 author
Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks

AMD

Published on: 2025-07-31 1 author
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Alibaba

Published on: 2025-07-31 1 author
Kimi K2: Open Agentic Intelligence

Moonshot AI

Published on: 2025-07-28 1 author
Scaling Data-Constrained Language Models

Google / Harvard University

Published on: 2025-07-28 1 author
Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice

ByteDance

Published on: 2025-07-27 1 author
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Cohere

Published on: 2025-07-25 1 author
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

Snowflake / Seoul National University

Published on: 2025-07-21 1 author
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

DeepSeek

Published on: 2025-07-18 1 author
Voxtral

Mistral AI

Published on: 2025-07-17
Apple Intelligence Foundation Language Models: Tech Report 2025

Apple

Published on: 2025-07-17 1 author
Non-preemptive Throughput Maximizationunder Time-varying Capacity

Google / University of Illinois Urbana-Champaign

Published on: 2025-07-16 1 author
MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Amazon / MIT

Published on: 2025-07-12 1 author
A Survey of Automatic Prompt Optimization with Instruction-focused Heuristic-based Search Algorithm

Intuit / Vanderbilt University

Published on: 2025-07-12 1 author
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization

Intuit / Vanderbilt University

Published on: 2025-07-12 1 author
Skywork-R1V3 Technical Report

Published on: 2025-07-10 1 author
SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Age

Anthropic / Redwood Research

Published on: 2025-07-08 1 author
Unconditional Diffusion for Generative Sequential Recommendation

ByteDance / University of Science and Technology of China

Published on: 2025-07-08 1 author
Evaluating the Critical Risks of Amazon’s Nova Premier under the Frontier Model Safety Framework

Amazon

Published on: 2025-07-07 1 author
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Published on: 2025-07-03 1 author
`For Argument's Sake, Show Me How to Harm Myself!': Jailbreaking LLMs in Suicide and Self-Harm Contexts

Published on: 2025-07-01 2 authors
UMA: A Family of Universal Models for Atoms

Meta Platforms / Carnegie Mellon University

Published on: 2025-06-30 1 author
Hierarchical Reasoning Model

Published on: 2025-06-26 9 authors
Steering Your Diffusion Policy with Latent Space Reinforcement Learning

Amazon

Published on: 2025-06-25 1 author
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Published on: 2025-06-24 1 author
KIMI-VL TECHNICAL REPORT

Moonshot AI

Published on: 2025-06-23 1 author
TransAct V2: Lifelong User Action Sequence Modeling on Pinterest Recommendation

Pinterest

Published on: 2025-06-21 1 author

Prev 23 24 25 26 27 28 29 30 31 32 33 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: