Papers

Filter by company

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Shanghai Jiao Tong University, The Hong Kong Polytechnic University

Published on: 2026-01-16 13 authors
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

Published on: 2026-01-15 11 authors
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Anthropic / University of Oxford

Published on: 2026-01-15 1 author
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Microsoft / Massachusetts Institute of Technology

Published on: 2026-01-15 1 author
Reasoning Models Generate Societies of Thought

Google / University of Chicago

Published on: 2026-01-15 1 author
Hardware Acceleration for Neural Networks: A Comprehensive Survey

Arizona State University

Published on: 2026-01-15 1 author
RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension

Z.ai / Xinjiang University

Published on: 2026-01-14 1 author
Motion Attribution for Video Generation

Published on: 2026-01-13 8 authors
The Hierarchy of Agentic Capabilities: Evaluating Frontier Models on Realistic RL Environments

Published on: 2026-01-13 5 authors
Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Meituan / Peking University

Published on: 2026-01-13 1 author
TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback

Amazon / Georgia Institute of Technology

Published on: 2026-01-13 1 author
Apollo: Unified Audio-Video Joint Generation

Kuaishou Technology

Published on: 2026-01-13 1 author
Controlled LLM Training on Spectral Sphere

Microsoft / Renmin University of China

Published on: 2026-01-13 1 author
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

ByteDance / Peking University

Published on: 2026-01-13 1 author
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

The Hong Kong Polytechnic University

Published on: 2026-01-13 5 authors
AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture

Zhejiang University College of Computer Science and Technology

Published on: 2026-01-13 7 authors
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

DeepSeek / Peking University

Published on: 2026-01-12 1 author
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

Snowflake / University of Maryland

Published on: 2026-01-12 1 author
Thinking—Fast, Slow, and Artificial: How AI is Reshaping Human Reasoning and the Rise of Cognitive Surrender

University of Pennsylvania

Published on: 2026-01-11 Venue: SSRN Preprint / The Wharton School Research Paper 2 authors
BabyVision: Visual Reasoning Beyond Language

Moonshot AI / Peking University

Published on: 2026-01-10 1 author
RigMo: Unifying Rig and Motion Learning for Generative Animation

Snap / University of Illinois Urbana-Champaign

Published on: 2026-01-10 1 author
AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines

AMD / Institute of Physics Belgrade

Published on: 2026-01-09 1 author
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Tencent / Singapore University of Technology and Design

Published on: 2026-01-09 1 author
UniFinEval: Towards Unified Evaluation of Financial Multimodal Models across Text, Images and Videos

Tencent / Shanghai University of Finance and Economics

Published on: 2026-01-09 1 author
Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Tencent / The University of Hong Kong

Published on: 2026-01-09 1 author
One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection

Tencent

Published on: 2026-01-09 1 author
GenCtrl -- A Formal Controllability Toolkit for Generative Models

Apple / Universitat Pompeu Fabra

Published on: 2026-01-09 1 author
Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Snap / Korea University

Published on: 2026-01-09 1 author
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Z.ai / Tsinghua University

Published on: 2026-01-09 1 author
GR-Dexter Technical Report

ByteDance

Published on: 2026-01-09 1 author
Challenges and Research Directions for Large Language Model Inference Hardware

Google DeepMind / Laude Institute, University of California, Berkeley

Published on: 2026-01-08 2 authors
Pixel-Perfect Visual Geometry Estimation

Xiaomi / Huazhong University of Science and Technology, Zhejiang University

Published on: 2026-01-08 1 author
DocDancer: Towards Agentic Document-Grounded Information Seeking

Tencent / Peking University

Published on: 2026-01-08 1 author
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Tencent / Institute of Information Engineering

Published on: 2026-01-08 1 author
InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Microsoft / Peking University

Published on: 2026-01-08 1 author
Internal Representations as Indicators of Hallucinations in Agent Tool Selection

Amazon

Published on: 2026-01-08 1 author
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Meta Platforms / King Abdullah University of Science and Technology (KAUST), Princeton University

Published on: 2026-01-08 1 author
FOREVER: Forgetting Curve-Inspired Memory Replay for Language Model Continual Learning

Tencent / The Hong Kong Polytechnic University

Published on: 2026-01-07 1 author
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Kuaishou Technology / Nanjing University

Published on: 2026-01-07 1 author
Extracting books from production language models

Alibaba / School of Cyber Science and Engineering, Wuhan University

Published on: 2026-01-06 4 authors
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Tencent / Peking University

Published on: 2026-01-06 1 author
A Versatile Multimodal Agent for Multimedia Content Generation

Tencent / University of Rochester

Published on: 2026-01-06 1 author
Efficient Context Scaling with LongCat ZigZag Attention

Meituan

Published on: 2026-01-06 1 author
Pearmut: Human Evaluation of Translation Made Trivial

Cohere

Published on: 2026-01-06 1 author
Listen to Rhythm, Choose Movements: Autoregressive Multimodal Dance Generation via Diffusion and Mamba with Decoupled Dance Dataset

Z.ai / University of China

Published on: 2026-01-06 1 author
Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Wuhan University

Published on: 2026-01-05 7 authors
CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models

AMD / Princeton University

Published on: 2026-01-05 1 author
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Tencent

Published on: 2026-01-05 1 author
RIMRULE: Improving Tool-Using Language Agents via MDL-Guided Rule Learning

Intuit / Temple University

Published on: 2026-01-05 1 author
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Z.ai / Tsinghua University

Published on: 2026-01-05 1 author

Prev 142 143 144 145 146 147 148 149 150 151 152 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: