Papers

Filter by company

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

MiniMax

Published on: 2025-09-26 1 author
Inference-Time Scaling for Generalist Reward Modeling

DeepSeek

Published on: 2025-09-25 1 author
MechStyle: Augmenting Generative AI with Mechanical Simulation to Create Stylized and Structurally Viable 3D Models

Google Research, Stability AI / Center for Bits and Atoms, MIT, Khoury College of Computer Sciences, Northeastern University, University of Washington

Published on: 2025-09-24 10 authors
FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models

Apple / The Ohio State University

Published on: 2025-09-24 6 authors
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation

MiniMax / South China University of Technology

Published on: 2025-09-24 1 author
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Baichuan Intelligent Technology

Published on: 2025-09-23 1 author
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Adobe

Published on: 2025-09-23 1 author
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Apple / Hanyang University

Published on: 2025-09-22 5 authors
"My Boyfriend is AI": A Computational Analysis of Human-AI Companionship in Reddit's AI Community

Massachusetts Institute of Technology

Published on: 2025-09-14 6 authors
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Microsoft / MIT

Published on: 2025-09-14 1 author
Towards an AI-Augmented Textbook

Google

Published on: 2025-09-13 34 authors
Steering MoE LLMs via Expert (De)Activation

Adobe

Published on: 2025-09-11 1 author
Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Alibaba

Published on: 2025-09-11 1 author
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data

Google

Published on: 2025-09-08 1 author
An AI System to Help Scientists Write Expert-Level Empirical Software

Google / MIT

Published on: 2025-09-08 1 author
Why Language Models Hallucinate

OpenAI / Georgia Institute of Technology

Published on: 2025-09-04 4 authors
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Moonshot AI / Tsinghua University

Published on: 2025-09-03 1 author
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Apple / Washington State University

Published on: 2025-08-27 1 author
Measuring the environmental impact of delivering AI at Google Scale

Google

Published on: 2025-08-21
3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds

NVIDIA / Stanford University

Published on: 2025-08-19 1 author
X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms

DeepSeek / University of Illinois Urbana-Champaign

Published on: 2025-08-18 1 author
NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation

Mila / University of Oxford

Published on: 2025-08-17 3 authors
Matrix-3D: Omnidirectional Explorable 3D World Generation

Published on: 2025-08-11
Amazon Ads Multi-Touch Attribution

Amazon / Northwestern University

Published on: 2025-08-11 1 author
Scaling Laws for Native Multimodal Models

Apple / Sorbonne University

Published on: 2025-08-09 1 author
Devstral: Fine-tuning Language Models for Coding Agent Applications

Mistral AI

Published on: 2025-08-08 1 author
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Amazon / Stanford University

Published on: 2025-08-07 1 author
No LLM Solved Yu Tsumura's 554th Problem

University of Cambridge, University of Oxford

Published on: 2025-08-05 2 authors
Why do LLMs attend to the first token

Google / University of Oxford

Published on: 2025-08-05 7 authors
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Amazon / Princeton University

Published on: 2025-08-05 1 author
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Published on: 2025-08-05 1 author
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

ByteDance / Tsinghua University

Published on: 2025-08-04 1 author
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

MetaGPT, Google, Microsoft / Nanyang Technological University, Université de Montréal, University of Illinois at Urbana-Champaign

Published on: 2025-08-02 1 author
Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks

AMD

Published on: 2025-07-31 1 author
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Alibaba

Published on: 2025-07-31 1 author
Kimi K2: Open Agentic Intelligence

Moonshot AI

Published on: 2025-07-28 1 author
Scaling Data-Constrained Language Models

Google / Harvard University

Published on: 2025-07-28 1 author
Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice

ByteDance

Published on: 2025-07-27 1 author
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Cohere

Published on: 2025-07-25 1 author
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

Snowflake / Seoul National University

Published on: 2025-07-21 1 author
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

DeepSeek

Published on: 2025-07-18 1 author
Voxtral

Mistral AI

Published on: 2025-07-17
Apple Intelligence Foundation Language Models: Tech Report 2025

Apple

Published on: 2025-07-17 1 author
Non-preemptive Throughput Maximizationunder Time-varying Capacity

Google / University of Illinois Urbana-Champaign

Published on: 2025-07-16 1 author
MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Amazon / MIT

Published on: 2025-07-12 1 author
A Survey of Automatic Prompt Optimization with Instruction-focused Heuristic-based Search Algorithm

Intuit / Vanderbilt University

Published on: 2025-07-12 1 author
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization

Intuit / Vanderbilt University

Published on: 2025-07-12 1 author
Skywork-R1V3 Technical Report

Published on: 2025-07-10 1 author
SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Age

Anthropic / Redwood Research

Published on: 2025-07-08 1 author
Unconditional Diffusion for Generative Sequential Recommendation

ByteDance / University of Science and Technology of China

Published on: 2025-07-08 1 author

Prev 61 62 63 64 65 66 67 68 69 70 71 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: