Papers

Filter by company

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Microsoft / MIT

Published on: 2026-01-15 1 author
Reasoning Models Generate Societies of Thought

Google

Published on: 2026-01-15 1 author
Hardware Acceleration for Neural Networks: A Comprehensive Survey

Arizona State University

Published on: 2026-01-15 1 author
RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension

Z.ai / Xinjiang University

Published on: 2026-01-14 1 author
TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback

Amazon

Published on: 2026-01-13 1 author
Apollo: Unified Audio-Video Joint Generation

Kuaishou Technology

Published on: 2026-01-13 1 author
Controlled LLM Training on Spectral Sphere

Microsoft / Renmin University

Published on: 2026-01-13 1 author
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

ByteDance / Peking University

Published on: 2026-01-13 1 author
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

The Hong Kong Polytechnic University

Published on: 2026-01-13 5 authors
AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture

Published on: 2026-01-13 7 authors
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

DeepSeek / Peking University

Published on: 2026-01-12 1 author
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

Snowflake / University of Maryland

Published on: 2026-01-12 1 author
BabyVision: Visual Reasoning Beyond Language

Moonshot AI / Peking University

Published on: 2026-01-10 1 author
RigMo: Unifying Rig and Motion Learning for Generative Animation

Snap / University of Illinois Urbana-Champaign

Published on: 2026-01-10 1 author
GenCtrl -- A Formal Controllability Toolkit for Generative Models

Apple / Universitat Pompeu Fabra

Published on: 2026-01-09 1 author
Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Snap / Korea University

Published on: 2026-01-09 1 author
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Z.ai / Tsinghua University

Published on: 2026-01-09 1 author
GR-Dexter Technical Report

ByteDance

Published on: 2026-01-09 1 author
InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Microsoft / Peking University

Published on: 2026-01-08 1 author
Internal Representations as Indicators of Hallucinations in Agent Tool Selection

Amazon

Published on: 2026-01-08 1 author
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Meta Platforms

Published on: 2026-01-08 1 author
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Kuaishou Technology / Nanjing University

Published on: 2026-01-07 1 author
Pearmut: Human Evaluation of Translation Made Trivial

Cohere

Published on: 2026-01-06 1 author
Listen to Rhythm, Choose Movements: Autoregressive Multimodal Dance Generation via Diffusion and Mamba with Decoupled Dance Dataset

Z.ai / University of China

Published on: 2026-01-06 1 author
RIMRULE: Improving Tool-Using Language Agents via MDL-Guided Rule Learning

Intuit / Temple University

Published on: 2026-01-05 1 author
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Z.ai / Tsinghua University

Published on: 2026-01-05 1 author
Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation

Microsoft / National University of Singapore

Published on: 2026-01-05 1 author
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

ByteDance / Tsinghua University

Published on: 2026-01-05 1 author
ELLA: Efficient Lifelong Learning for Adapters

Amazon / Purdue University

Published on: 2026-01-05 1 author
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes

Amazon

Published on: 2026-01-05 1 author
mHC: Manifold-Constrained Hyper-Connections

DeepSeek

Published on: 2026-01-05 1 author
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek

Published on: 2026-01-04 1 author
AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting

Snap / Tübingen AI Center, University of Tübingen,

Published on: 2026-01-04 1 author
Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows

Snap / Tübingen AI Center, University of Tübingen,

Published on: 2026-01-04 1 author
NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Apple / University of Illinois Urbana-Champaign

Published on: 2026-01-03 1 author
Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet Routing

Apple / The University of Nottingham

Published on: 2025-12-31 1 author
GARDO: Reinforcing Diffusion Models without Reward Hacking

Kuaishou Technology / Hong Kong University of Science and Technology

Published on: 2025-12-30 1 author
ThinkGen: Generalized Thinking for Visual Generation

ByteDance / Beijing Jiaotong University

Published on: 2025-12-29 1 author
Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

Apple

Published on: 2025-12-26 1 author
DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO

Kuaishou Technology

Published on: 2025-12-25 1 author
SemanticGen: Video Generation in Semantic Space

Kuaishou Technology / Zhejiang University

Published on: 2025-12-25 1 author
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

DeepSeek

Published on: 2025-12-23 1 author
FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models

Snap / Sun Yat-sen University

Published on: 2025-12-23 1 author
GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation

ByteDance

Published on: 2025-12-23 1 author
From Word to World: Can Large Language Models be Implicit Text-based World Models?

Microsoft / Southern University of Science and Technology

Published on: 2025-12-21 1 author
Sigma-MoE-Tiny Technical Report

Microsoft / Microsoft Research

Published on: 2025-12-19 1 author
Journey Before Destination: On the importance of Visual Faithfulness in Slow Thinking

Amazon / University of Wisconsin-Madison

Published on: 2025-12-19 1 author
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Meta Platforms

Published on: 2025-12-19 1 author
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities.

Google

Published on: 2025-12-19
Kling-Omni Technical Report

Kuaishou Technology

Published on: 2025-12-18 1 author

Prev 1 2 3 4 5 6 7 8 9 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: