Papers

Filter by company

STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

Amazon / University of Massachusetts Amherst

Published on: 2026-03-05 1 author
AI+HW 2035: Shaping the Next Decade

NVIDIA / University of Illinois Urbana-Champaign

Published on: 2026-03-05 1 author
FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

Meta Platforms, NVIDIA, Google, Together AI / Princeton University

Published on: 2026-03-05 1 author
Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

NVIDIA / Nanjing University

Published on: 2026-03-05 1 author
KARL: Knowledge Agents via Reinforcement Learning

Databricks

Published on: 2026-03-05 1 author
FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning

Adobe / University of California

Published on: 2026-03-05 1 author
AgentIR: Reasoning-Aware Retrieval for Deep Research Agents

Carnegie Mellon University, University of Queenland, University of Waterloo

Published on: 2026-03-04 6 authors
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

Alibaba

Published on: 2026-03-04 5 authors
Adaptive Memory Admission Control for LLM Agents

Workday

Published on: 2026-03-04 1 author
ZipMap: Linear-Time Stateful 3D Reconstruction with Test-Time Training

Google / Cornell University, MIT

Published on: 2026-03-04 1 author
Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Microsoft / The University of Texas

Published on: 2026-03-04 1 author
Single-minus graviton tree amplitudes are nonzero

OpenAI / Vanderbilt University

Published on: 2026-03-04 1 author
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Meta Platforms / Duke University

Published on: 2026-03-04 1 author
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection

DeepSeek / Beihang University

Published on: 2026-03-04 1 author
RoboCasa365: A Large-Scale Simulation Framework for Training and Benchmarking Generalist Robots

NVIDIA / The University of Texas

Published on: 2026-03-04 1 author
InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Meta Platforms, Adobe / University of Memphis, University of Oregon

Published on: 2026-03-04 1 author
ManipulationNet: An Infrastructure for Benchmarking Real-World Robot Manipulation with Physical Skill Challenges and Embodied Multimodal Reasoning

NVIDIA / Rice University

Published on: 2026-03-04 1 author
V1 : Unifying Generation and Self-Verification for Parallel Reasoners

NVIDIA, Together AI / UC Berkeley

Published on: 2026-03-04 1 author
Phi-4-reasoning-vision-15B Technical Report

Microsoft

Published on: 2026-03-04 1 author
Helios: Real Real-Time Long Video Generation Model

ByteDance / Peking University

Published on: 2026-03-04 1 author
EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Sentient / Virginia Tech

Published on: 2026-03-03 5 authors
Speculative Speculative Decoding

Together AI / Stanford University

Published on: 2026-03-03 1 author
OneRanker: Unified Generation and Ranking with One Model in Industrial Advertising Recommendation

Tencent

Published on: 2026-03-03 1 author
UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Microsoft / Shanghai Jiao Tong University

Published on: 2026-03-03 1 author
NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing

Tencent

Published on: 2026-03-03 1 author
Kling-MotionControl Technical Report

Kuaishou Technology

Published on: 2026-03-03 1 author
Architecting Trust in Artificial Epistemic Agents

Google

Published on: 2026-03-03 1 author
Utonia: Toward One Encoder for All Point Clouds

Xiaomi / The University of Hong Kong

Published on: 2026-03-03 1 author
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Google / UC Berkeley

Published on: 2026-03-03 1 author
Beyond Pixel Histories: World Models with Persistent 3D State

Microsoft / University of Edinburgh

Published on: 2026-03-03 1 author
Heterogeneous Agent Collaborative Reinforcement Learning

ByteDance

Published on: 2026-03-03 1 author
Beyond Language Modeling: An Exploration of Multimodal Pretraining

Meta Platforms / New York University

Published on: 2026-03-03 1 author
Modular Memory is the Key to Continual Learning Agents

Microsoft / University of Bremen

Published on: 2026-03-02 1 author
Expanding LLM Agent Boundaries with Strategy-Guided Exploratio

Apple

Published on: 2026-03-02 1 author
ROBOMETER: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

NVIDIA / University of Southern California

Published on: 2026-03-02 2 authors
LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving

Xiaomi

Published on: 2026-03-02 1 author
Agentic Code Reasoning

Meta Platforms

Published on: 2026-03-02 1 author
CuTe Layout Representation and Algebra

NVIDIA

Published on: 2026-03-02 1 author
RubricBench: Aligning Model-Generated Rubrics with Human Standards

Tencent / University of Illinois Springfield

Published on: 2026-03-02 1 author
WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memorie

Tencent / Zhejiang University

Published on: 2026-03-02 1 author
How Well Does Agent Development Reflect Real-World Work?

Published on: 2026-03-01 10 authors
Learn Hard Problems During RL with Reference Guided Fine-tuning

ByteDance / UC Berkeley

Published on: 2026-03-01 1 author
SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

Alibaba

Published on: 2026-02-28 1 author
Process-of-Thought Reasoning for Videos

Snap

1 author
Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Helmholtz Munich, Technical University of Munich, University of Tübingen

Published on: 2026-02-27 3 authors
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models

Xiaomi / Wuhan University

Published on: 2026-02-27 1 author
Mode Seeking meets Mean Seeking for Fast Long Video Generation

NVIDIA

Published on: 2026-02-27 1 author
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

ByteDance / Tsinghua University

Published on: 2026-02-27 1 author
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding

Xiaomi / Tongji University

Published on: 2026-02-26 1 author
ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding

Xiaomi / Huazhong University of Science and Technology

Published on: 2026-02-26 1 author

Prev 32 33 34 35 36 37 38 39 40 41 42 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: