Papers

Filter by company

Lost in Transmission: When and Why LLMs Fail to Reason Globally

Microsoft

Published on: 2026-01-30 1 author
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Published on: 2026-01-29 15 authors
PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Tencent / Nanyang Technological University

Published on: 2026-01-29 1 author
M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization

AMD / Shanghai Jiao Tong University

Published on: 2026-01-28 1 author
Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding

Tencent

Published on: 2026-01-28 1 author
Thought-Transfer: Indirect Targeted Poisoning Attacks on Chain-of-Thought Reasoning Models

Anthropic / Northeastern University, China

Published on: 2026-01-28 1 author
DeepSeek-OCR 2: Visual Causal Flow

DeepSeek

Published on: 2026-01-28 1 author
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Moonshot AI

Published on: 2026-01-28 1 author
Efficient Autoregressive Video Diffusion with Dummy Head

Microsoft

Published on: 2026-01-28 1 author
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Google / University of Washington

Published on: 2026-01-28 Venue: NeurIPS 2022 8 authors
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Tencent

Published on: 2026-01-27 1 author
Astra: General Interactive World Model with Autoregressive Denoising

Kuaishou Technology / Tsinghua University

Published on: 2026-01-27 1 author
Towards Pixel-Level VLM Perception via Simple Points Prediction

Moonshot AI / Nanjing University

Published on: 2026-01-27 1 author
Differentiable Semantic ID for Generative Recommendation

Amazon / University of Glasgow

Published on: 2026-01-27 1 author
Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory

Intel / MIT, Tsinghua University,

Published on: 2026-01-26 1 author
Agentic Very Long Video Understanding

Meta Platforms / University of Wisconsin-Madison

Published on: 2026-01-26 1 author
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Meituan / Fudan University

Published on: 2026-01-23 1 author
AnyView: Synthesizing Any Novel View in Dynamic Scenes

Amazon / Toyota Research Institute

Published on: 2026-01-23 1 author
Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection Systems

Universidad Rey Juan Carlos

Published on: 2026-01-23 2 authors
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

Together AI / Stanford University

Published on: 2026-01-22 1 author
CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback

Kuaishou Technology / Hong Kong University of Science and Technology

Published on: 2026-01-22 1 author
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Microsoft

Published on: 2026-01-22 1 author
DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice

Eleven Labs

Published on: 2026-01-22 1 author
RayRoPE: Projective Ray Positional Encoding for Multi-view Attention

Apple / Carnegie Mellon University

Published on: 2026-01-21 1 author
OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Snap / University of Washington

Published on: 2026-01-21 1 author
Unified Text-Image Generation with Weakness-Targeted Post-Training

Meta Platforms / University of Texas

Published on: 2026-01-21 1 author
Zebra-Llama: Towards Extremely Efficient Hybrid Models

AMD

Published on: 2026-01-20 1 author
From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMs

Xiaomi / University of Tokyo

Published on: 2026-01-20 1 author
Learning Latent Action World Models In The Wild

Meta Platforms

Published on: 2026-01-20
SOFAI-LM: A Cognitive Architecture for Building Efficient and Reliable Reasoning Systems with LLMs

IBM

Published on: 2026-01-20 1 author
Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network Planning

King Abdullah University of Science and Technology (KAUST)

Published on: 2026-01-20 3 authors
Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Together AI / The University of Texas

Published on: 2026-01-19 1 author
RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering

Tencent / Tongji University

Published on: 2026-01-19 1 author
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation

Snap / Northeastern University, China

Published on: 2026-01-19 1 author
Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language Models

Published on: 2026-01-19 2 authors
Power Aware Dynamic Reallocation For Inference

AMD

Published on: 2026-01-18 1 author
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

Intuit / Arizona State University

Published on: 2026-01-18 1 author
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Amazon / University of California

Published on: 2026-01-18 1 author
Agentic Reasoning for Large Language Models

Meta Platforms / Amazon Research, Google DeepMind, Meta, University of Illinois Urbana-Champaign

Published on: 2026-01-18 1 author
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Anthropic / Stanford University

Published on: 2026-01-17 1 author
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Published on: 2026-01-16 7 authors
VINO: A Unified Visual Generator with Interleaved OmniModal Context

Kuaishou Technology / Shanghai Jiao Tong University

Published on: 2026-01-16
KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

Meta Platforms

Published on: 2026-01-16 1 author
OCTOBENCH: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

MiniMax / Fudan University

Published on: 2026-01-16 1 author
TranslateGemma Technical Report

Google

Published on: 2026-01-16
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Published on: 2026-01-16 13 authors
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Anthropic / University of Oxford

Published on: 2026-01-15 1 author
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Microsoft / MIT

Published on: 2026-01-15 1 author
Reasoning Models Generate Societies of Thought

Google

Published on: 2026-01-15 1 author
Hardware Acceleration for Neural Networks: A Comprehensive Survey

Arizona State University

Published on: 2026-01-15 1 author

Prev 16 17 18 19 20 21 22 23 24 25 26 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: