Papers

Filter by company

Power Aware Dynamic Reallocation For Inference

AMD

Published on: 2026-01-18 1 author
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

Intuit / Arizona State University

Published on: 2026-01-18 1 author
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Amazon / University of California

Published on: 2026-01-18 1 author
Agentic Reasoning for Large Language Models

Meta Platforms / Amazon Research, Google DeepMind, Meta, University of Illinois Urbana-Champaign

Published on: 2026-01-18 1 author
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Anthropic / Stanford University

Published on: 2026-01-17 1 author
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Flash Labs

Published on: 2026-01-16 7 authors
VINO: A Unified Visual Generator with Interleaved OmniModal Context

Kuaishou Technology / Shanghai Jiao Tong University

Published on: 2026-01-16
KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

Meta Platforms

Published on: 2026-01-16 1 author
OCTOBENCH: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

MiniMax / Fudan University

Published on: 2026-01-16 1 author
TranslateGemma Technical Report

Google

Published on: 2026-01-16
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Published on: 2026-01-16 13 authors
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Anthropic / University of Oxford

Published on: 2026-01-15 1 author
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Microsoft / MIT

Published on: 2026-01-15 1 author
Reasoning Models Generate Societies of Thought

Google

Published on: 2026-01-15 1 author
Hardware Acceleration for Neural Networks: A Comprehensive Survey

Arizona State University

Published on: 2026-01-15 1 author
RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension

Z.ai / Xinjiang University

Published on: 2026-01-14 1 author
Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Meituan / Peking University

Published on: 2026-01-13 1 author
TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback

Amazon

Published on: 2026-01-13 1 author
Apollo: Unified Audio-Video Joint Generation

Kuaishou Technology

Published on: 2026-01-13 1 author
Controlled LLM Training on Spectral Sphere

Microsoft / Renmin University

Published on: 2026-01-13 1 author
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

ByteDance / Peking University

Published on: 2026-01-13 1 author
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

The Hong Kong Polytechnic University

Published on: 2026-01-13 5 authors
AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture

Published on: 2026-01-13 7 authors
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

DeepSeek / Peking University

Published on: 2026-01-12 1 author
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

Snowflake / University of Maryland

Published on: 2026-01-12 1 author
BabyVision: Visual Reasoning Beyond Language

Moonshot AI / Peking University

Published on: 2026-01-10 1 author
RigMo: Unifying Rig and Motion Learning for Generative Animation

Snap / University of Illinois Urbana-Champaign

Published on: 2026-01-10 1 author
AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines

AMD / Institute of Physics Belgrade

Published on: 2026-01-09 1 author
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Tencent / Singapore University of Technology and Design

Published on: 2026-01-09 1 author
UniFinEval: Towards Unified Evaluation of Financial Multimodal Models across Text, Images and Videos

Tencent / Shanghai University of Finance and Economics

Published on: 2026-01-09 1 author
Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Tencent / The University of Hong Kong

Published on: 2026-01-09 1 author
One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection

Tencent

Published on: 2026-01-09 1 author
GenCtrl -- A Formal Controllability Toolkit for Generative Models

Apple / Universitat Pompeu Fabra

Published on: 2026-01-09 1 author
Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Snap / Korea University

Published on: 2026-01-09 1 author
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Z.ai / Tsinghua University

Published on: 2026-01-09 1 author
GR-Dexter Technical Report

ByteDance

Published on: 2026-01-09 1 author
Challenges and Research Directions for Large Language Model Inference Hardware

Google DeepMind / Laude Institute, University of California, Berkeley

Published on: 2026-01-08 2 authors
Pixel-Perfect Visual Geometry Estimation

Xiaomi

Published on: 2026-01-08 1 author
DocDancer: Towards Agentic Document-Grounded Information Seeking

Tencent / Peking University

Published on: 2026-01-08 1 author
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Tencent / Institute of Information Engineering

Published on: 2026-01-08 1 author
InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Microsoft / Peking University

Published on: 2026-01-08 1 author
Internal Representations as Indicators of Hallucinations in Agent Tool Selection

Amazon

Published on: 2026-01-08 1 author
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Meta Platforms

Published on: 2026-01-08 1 author
FOREVER: Forgetting Curve-Inspired Memory Replay for Language Model Continual Learning

Tencent / The Hong Kong Polytechnic University

Published on: 2026-01-07 1 author
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Kuaishou Technology / Nanjing University

Published on: 2026-01-07 1 author
Extracting books from production language models

Alibaba / School of Cyber Science and Engineering, Wuhan University

Published on: 2026-01-06 4 authors
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Tencent

Published on: 2026-01-06 1 author
A Versatile Multimodal Agent for Multimedia Content Generation

Tencent / University of Rochester

Published on: 2026-01-06 1 author
Efficient Context Scaling with LongCat ZigZag Attention

Meituan

Published on: 2026-01-06 1 author
Pearmut: Human Evaluation of Translation Made Trivial

Cohere

Published on: 2026-01-06 1 author

Prev 37 38 39 40 41 42 43 44 45 46 47 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: