Papers

Filter by company

The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

Anthropic / University of Edinburgh

Published on: 2026-01-30
Lost in Transmission: When and Why LLMs Fail to Reason Globally

Microsoft

Published on: 2026-01-30 1 author
Scaling Embeddings Outperforms Scaling Experts in Language Models

Published on: 2026-01-29 16 authors
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Baidu

Published on: 2026-01-29 15 authors
PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Tencent / Nanyang Technological University

Published on: 2026-01-29 1 author
How AI Impacts Skill Formation

Published on: 2026-01-28 2 authors
M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization

AMD / Shanghai Jiao Tong University

Published on: 2026-01-28 1 author
Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding

Tencent

Published on: 2026-01-28 1 author
Thought-Transfer: Indirect Targeted Poisoning Attacks on Chain-of-Thought Reasoning Models

Anthropic / Northeastern University, China

Published on: 2026-01-28 1 author
DeepSeek-OCR 2: Visual Causal Flow

DeepSeek

Published on: 2026-01-28 1 author
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Moonshot AI

Published on: 2026-01-28 1 author
Efficient Autoregressive Video Diffusion with Dummy Head

Microsoft / ETH Zurich, Johns Hopkins University, Tsinghua University, University of Science and Technology of China

Published on: 2026-01-28 1 author
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Google / University of Washington

Published on: 2026-01-28 Venue: NeurIPS 2022 8 authors
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Tencent

Published on: 2026-01-27 1 author
Astra: General Interactive World Model with Autoregressive Denoising

Kuaishou Technology / Tsinghua University

Published on: 2026-01-27 1 author
Towards Pixel-Level VLM Perception via Simple Points Prediction

Moonshot AI / Nanjing University

Published on: 2026-01-27 1 author
Differentiable Semantic ID for Generative Recommendation

Amazon / University of Glasgow

Published on: 2026-01-27 1 author
Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory

Intel / Massachusetts Institute of Technology, Tsinghua University

Published on: 2026-01-26 1 author
Agentic Very Long Video Understanding

Meta Platforms / University of Wisconsin-Madison

Published on: 2026-01-26 1 author
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Meituan / Fudan University

Published on: 2026-01-23 1 author
AnyView: Synthesizing Any Novel View in Dynamic Scenes

Amazon / Toyota Research Institute

Published on: 2026-01-23 1 author
Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection Systems

Universidad Rey Juan Carlos

Published on: 2026-01-23 2 authors
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

Together AI / Stanford University

Published on: 2026-01-22 1 author
CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback

Kuaishou Technology / Hong Kong University of Science and Technology

Published on: 2026-01-22 1 author
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Microsoft / Hong Kong University of Science and Technology, Massachusetts Institute of Technology, Shanghai Artificial Intelligence Laboratory, Shanghai Jiao Tong University, Tsinghua University

Published on: 2026-01-22 1 author
DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice

Eleven Labs / Shanghai Jiao Tong University

Published on: 2026-01-22 1 author
RayRoPE: Projective Ray Positional Encoding for Multi-view Attention

Apple / Carnegie Mellon University

Published on: 2026-01-21 1 author
OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Snap / University of Washington

Published on: 2026-01-21 1 author
Unified Text-Image Generation with Weakness-Targeted Post-Training

Meta Platforms / University of Texas

Published on: 2026-01-21 1 author
Zebra-Llama: Towards Extremely Efficient Hybrid Models

AMD

Published on: 2026-01-20 1 author
From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMs

Xiaomi / University of Tokyo

Published on: 2026-01-20 1 author
Learning Latent Action World Models In The Wild

Meta Platforms / National Institute for Research in Digital Science and Technology, New York University

Published on: 2026-01-20
SOFAI-LM: A Cognitive Architecture for Building Efficient and Reliable Reasoning Systems with LLMs

IBM

Published on: 2026-01-20 1 author
Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network Planning

King Abdullah University of Science and Technology (KAUST)

Published on: 2026-01-20 3 authors
Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Together AI / University of Texas

Published on: 2026-01-19 1 author
RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering

Tencent / Tongji University

Published on: 2026-01-19 1 author
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation

Snap / Northeastern University, China

Published on: 2026-01-19 1 author
Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language Models

University of Florida

Published on: 2026-01-19 2 authors
Adaptive Rotary Steering with Joint Autoregression for Robust Extraction of Closely Moving Speakers in Dynamic Scenarios

Published on: 2026-01-18 2 authors
Power Aware Dynamic Reallocation For Inference

AMD

Published on: 2026-01-18 1 author
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

Intuit / Arizona State University

Published on: 2026-01-18 1 author
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Amazon / University of California

Published on: 2026-01-18 1 author
Agentic Reasoning for Large Language Models

Meta Platforms / Amazon Research, Google DeepMind, Meta, University of Illinois Urbana-Champaign

Published on: 2026-01-18 1 author
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Anthropic / Stanford University

Published on: 2026-01-17 1 author
Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration

Published on: 2026-01-16 6 authors
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Flash Labs

Published on: 2026-01-16 7 authors
VINO: A Unified Visual Generator with Interleaved OmniModal Context

Kuaishou Technology / Shanghai Jiao Tong University

Published on: 2026-01-16
KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

Meta Platforms

Published on: 2026-01-16 1 author
OCTOBENCH: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

MiniMax / Fudan University

Published on: 2026-01-16 1 author
TranslateGemma Technical Report

Google

Published on: 2026-01-16

Prev 141 142 143 144 145 146 147 148 149 150 151 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: