Papers

Filter by company

AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture

Published on: 2026-01-13 7 authors
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

DeepSeek / Peking University

Published on: 2026-01-12 1 author
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

Snowflake / University of Maryland

Published on: 2026-01-12 1 author
BabyVision: Visual Reasoning Beyond Language

Moonshot AI / Peking University

Published on: 2026-01-10 1 author
RigMo: Unifying Rig and Motion Learning for Generative Animation

Snap / University of Illinois Urbana-Champaign

Published on: 2026-01-10 1 author
AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines

AMD / Institute of Physics Belgrade

Published on: 2026-01-09 1 author
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Tencent / Singapore University of Technology and Design

Published on: 2026-01-09 1 author
UniFinEval: Towards Unified Evaluation of Financial Multimodal Models across Text, Images and Videos

Tencent / Shanghai University of Finance and Economics

Published on: 2026-01-09 1 author
Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Tencent / The University of Hong Kong

Published on: 2026-01-09 1 author
One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection

Tencent

Published on: 2026-01-09 1 author
GenCtrl -- A Formal Controllability Toolkit for Generative Models

Apple / Universitat Pompeu Fabra

Published on: 2026-01-09 1 author
Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Snap / Korea University

Published on: 2026-01-09 1 author
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Z.ai / Tsinghua University

Published on: 2026-01-09 1 author
GR-Dexter Technical Report

ByteDance

Published on: 2026-01-09 1 author
Challenges and Research Directions for Large Language Model Inference Hardware

Google DeepMind / Laude Institute, University of California, Berkeley

Published on: 2026-01-08 2 authors
Pixel-Perfect Visual Geometry Estimation

Xiaomi

Published on: 2026-01-08 1 author
DocDancer: Towards Agentic Document-Grounded Information Seeking

Tencent / Peking University

Published on: 2026-01-08 1 author
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Tencent / Institute of Information Engineering

Published on: 2026-01-08 1 author
InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Microsoft / Peking University

Published on: 2026-01-08 1 author
Internal Representations as Indicators of Hallucinations in Agent Tool Selection

Amazon

Published on: 2026-01-08 1 author
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Meta Platforms

Published on: 2026-01-08 1 author
FOREVER: Forgetting Curve-Inspired Memory Replay for Language Model Continual Learning

Tencent / The Hong Kong Polytechnic University

Published on: 2026-01-07 1 author
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Kuaishou Technology / Nanjing University

Published on: 2026-01-07 1 author
Extracting books from production language models

Alibaba / School of Cyber Science and Engineering, Wuhan University

Published on: 2026-01-06 4 authors
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Tencent

Published on: 2026-01-06 1 author
A Versatile Multimodal Agent for Multimedia Content Generation

Tencent / University of Rochester

Published on: 2026-01-06 1 author
Efficient Context Scaling with LongCat ZigZag Attention

Meituan

Published on: 2026-01-06 1 author
Pearmut: Human Evaluation of Translation Made Trivial

Cohere

Published on: 2026-01-06 1 author
Listen to Rhythm, Choose Movements: Autoregressive Multimodal Dance Generation via Diffusion and Mamba with Decoupled Dance Dataset

Z.ai / University of China

Published on: 2026-01-06 1 author
Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Published on: 2026-01-05 7 authors
CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models

AMD / Princeton University

Published on: 2026-01-05 1 author
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Tencent

Published on: 2026-01-05 1 author
RIMRULE: Improving Tool-Using Language Agents via MDL-Guided Rule Learning

Intuit / Temple University

Published on: 2026-01-05 1 author
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Z.ai / Tsinghua University

Published on: 2026-01-05 1 author
Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation

Microsoft / National University of Singapore

Published on: 2026-01-05 1 author
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

ByteDance / Tsinghua University

Published on: 2026-01-05 1 author
ELLA: Efficient Lifelong Learning for Adapters

Amazon / Purdue University

Published on: 2026-01-05 1 author
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes

Amazon

Published on: 2026-01-05 1 author
mHC: Manifold-Constrained Hyper-Connections

DeepSeek

Published on: 2026-01-05 1 author
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek

Published on: 2026-01-04 1 author
AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting

Snap / Tübingen AI Center, University of Tübingen,

Published on: 2026-01-04 1 author
Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows

Snap / Tübingen AI Center, University of Tübingen,

Published on: 2026-01-04 1 author
NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Apple / University of Illinois Urbana-Champaign

Published on: 2026-01-03 1 author
DriveLaW:Unifying Planning and Video Generation in a Latent Driving World

Xiaomi / Huazhong University of Science and Technology

Published on: 2025-12-31 1 author
Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet Routing

Apple / The University of Nottingham

Published on: 2025-12-31 1 author
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Tencent / Singapore Management University

Published on: 2025-12-30 1 author
HY-MT1.5 Technical Report

Tencent

Published on: 2025-12-30 1 author
GARDO: Reinforcing Diffusion Models without Reward Hacking

Kuaishou Technology / Hong Kong University of Science and Technology

Published on: 2025-12-30 1 author
Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio Generation

Xiaomi

Published on: 2025-12-29 1 author
ThinkGen: Generalized Thinking for Visual Generation

ByteDance / Beijing Jiaotong University

Published on: 2025-12-29 1 author

Prev 57 58 59 60 61 62 63 64 65 66 67 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: