Papers

Filter by company

World Guidance: World Modeling in Condition Space for Action Generatio

ByteDance / The University of Hong Kong

Published on: 2026-02-25 1 author
The Design Space of Tri-Modal Masked Diffusion Models

Apple / University of Cambridge

Published on: 2026-02-25 1 author
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Microsoft / University of Illinois Urbana-Champaign

Published on: 2026-02-25 1 author
UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling

Xiaomi / University of Illinois Urbana-Champaign

Published on: 2026-02-24 1 author
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection

Xiaomi / Wuhan University

Published on: 2026-02-24 1 author
VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Xiaomi / Tianjin University

Published on: 2026-02-24 1 author
Test-Time Training with KV Binding Is Secretly Linear Attention

NVIDIA

Published on: 2026-02-24 1 author
S-PRESSO: Ultra Low Bitrate Sound Effect Compression With Diffusion Autoencoders And Offline Quantization

Sony Group Corporation (AIBO) / Institut Polytechnique de Paris

Published on: 2026-02-23 1 author
gQIR: Generative Quanta Image Reconstruction

Snap / University of Wisconsin-Madison

Published on: 2026-02-23 1 author
Compositional Planning with Jumpy World Models

Meta Platforms / McGill University

Published on: 2026-02-23 1 author
SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

Amazon / The Pennsylvania State University

Published on: 2026-02-23 1 author
Toward the Thermodynamic Limit: Neural Operators for Non-equilibrium Dynamics of Mott Insulators

NVIDIA

Published on: 2026-02-23 1 author
Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Apple / Stanford University

Published on: 2026-02-23 1 author
Haitao Lin

Tencent / Fudan University

Published on: 2026-02-23 1 author
How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Meituan

Published on: 2026-02-23 1 author
Event-Triggered Gossip for Distributed Learning

Intel

Published on: 2026-02-22 1 author
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Meituan

Published on: 2026-02-21 1 author
Discovering Multiagent Learning Algorithms with Large Language Models

Google

Published on: 2026-02-21 1 author
HyTRec: A Hybrid Temporal-Aware Attention Architecture for Long Behavior Sequential Recommendation

Tencent / Wuhan University

Published on: 2026-02-20 1 author
ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User Context

Apple

Published on: 2026-02-20 1 author
Wink: Recovering from Misbehaviors in Coding Agents

Meta Platforms

Published on: 2026-02-20 1 author
Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space

ByteDance

Published on: 2026-02-20 1 author
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Stanford University

Published on: 2026-02-20
The Geometry of Noise: Why Diffusion Models Don't Need Noise Conditioning

Google

Published on: 2026-02-20 1 author
SARAH: Spatially Aware Real-time Agentic Humans

Meta Platforms

Published on: 2026-02-20 1 author
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

MiniMax

1 author
Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

Sony Group Corporation (AIBO) / Stanford University

Published on: 2026-02-19 1 author
El Agente Gráfico: Structured Execution Graphs for Scientific Agents

NVIDIA / University of Toronto

Published on: 2026-02-19 1 author
Unified Latents (UL): How to train your latents

Google / Google DeepMind

Published on: 2026-02-19 1 author
Multi-agent cooperation through in-context co-player inference

Published on: 2026-02-18 7 authors
Factored Latent Action World Models

Sony Group Corporation (AIBO)

Published on: 2026-02-18 1 author
Tuning-free Visual Effect Transfer across Videos

Snap / Carnegie Mellon University

Published on: 2026-02-18 1 author
EVMbench: Evaluating AI Agents on Smart Contract Security

OpenAI

Published on: 2026-02-18 1 author
EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data

NVIDIA / University of California

Published on: 2026-02-18 1 author
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

Amazon / UC Berkeley

Published on: 2026-02-17 1 author
jina-embeddings-v5-text: Task-Targeted Embedding Distillation

Jina AI

Published on: 2026-02-17 1 author
World Action Models are Zero-shot Policies

NVIDIA

Published on: 2026-02-17 1 author
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Google / Northwestern University

Published on: 2026-02-17 1 author
GLM-5: from Vibe Coding to Agentic Engineering

Z.ai / Tsinghua University

Published on: 2026-02-17
Image Generation with a Sphere Encoder

Meta Platforms

Published on: 2026-02-16 1 author
GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training

Tencent, AMD / Peking University

Published on: 2026-02-15 1 author
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Tencent

Published on: 2026-02-15 1 author
BitDance: Scaling Autoregressive Generative Models with Binary Tokens

ByteDance / The Chinese University of Hong Kong

Published on: 2026-02-15 1 author
Experiential Reinforcement Learning

Microsoft / University of Southern California

Published on: 2026-02-15 1 author
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

OpenAI / Amazon, Anthropic, Meta

Published on: 2026-02-15 1 author
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

AMD

Published on: 2026-02-14 1 author
Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

Intel

Published on: 2026-02-14 1 author
Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series

Intel

Published on: 2026-02-14 1 author
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Kuaishou Technology

Published on: 2026-02-14 1 author
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Apple / UC Berkeley

Published on: 2026-02-14 1 author

Prev 11 12 13 14 15 16 17 18 19 20 21 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: