Papers

Filter by company

Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA

Openmachine

Published on: 2025-06-03 1 author
Process-of-Thought Reasoning for Videos

Snap

1 author
Mode Seeking meets Mean Seeking for Fast Long Video Generation

NVIDIA

Published on: 2026-02-27 1 author
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

ByteDance / Tsinghua University

Published on: 2026-02-27 1 author
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Microsoft

Published on: 2026-02-26 1 author
SceneTransporter: Optimal Transport-Guided Compositional Latent Diffusion for Single-Image Structured 3D Scene Generation

Tsinghua University

Published on: 2026-02-26 1 author
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model

Published on: 2026-02-26 1 author
Generative Recommendation for Large-Scale Advertising

Kuaishou Technology

Published on: 2026-02-26 1 author
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Microsoft

Published on: 2026-02-26 1 author
VGG-T3: Offline Feed-Forward 3D Reconstruction at Scale

NVIDIA / University of Toronto

Published on: 2026-02-26 1 author
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

DeepSeek

Published on: 2026-02-26 1 author
TrajTok: Learning Trajectory Tokens enables better Video Understanding

Apple / University of Washington

Published on: 2026-02-26 1 author
World Guidance: World Modeling in Condition Space for Action Generation

ByteDance / The University of Hong Kong

Published on: 2026-02-25 1 author
World Guidance: World Modeling in Condition Space for Action Generatio

ByteDance / The University of Hong Kong

Published on: 2026-02-25 1 author
The Design Space of Tri-Modal Masked Diffusion Models

Apple / University of Cambridge

Published on: 2026-02-25 1 author
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Microsoft / University of Illinois Urbana-Champaign

Published on: 2026-02-25 1 author
Test-Time Training with KV Binding Is Secretly Linear Attention

NVIDIA

Published on: 2026-02-24 1 author
gQIR: Generative Quanta Image Reconstruction

Snap / University of Wisconsin-Madison

Published on: 2026-02-23 1 author
Compositional Planning with Jumpy World Models

Meta Platforms / McGill University

Published on: 2026-02-23 1 author
SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

Amazon / The Pennsylvania State University

Published on: 2026-02-23 1 author
Toward the Thermodynamic Limit: Neural Operators for Non-equilibrium Dynamics of Mott Insulators

NVIDIA

Published on: 2026-02-23 1 author
Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Apple / Stanford University

Published on: 2026-02-23 1 author
Haitao Lin

Tencent / Fudan University

Published on: 2026-02-23 1 author
How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Meituan

Published on: 2026-02-23 1 author
Discovering Multiagent Learning Algorithms with Large Language Models

Google

Published on: 2026-02-21 1 author
ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User Context

Apple

Published on: 2026-02-20 1 author
Wink: Recovering from Misbehaviors in Coding Agents

Meta Platforms

Published on: 2026-02-20 1 author
Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space

ByteDance

Published on: 2026-02-20 1 author
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Stanford University

Published on: 2026-02-20
The Geometry of Noise: Why Diffusion Models Don't Need Noise Conditioning

Google

Published on: 2026-02-20 1 author
SARAH: Spatially Aware Real-time Agentic Humans

Meta Platforms

Published on: 2026-02-20 1 author
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

MiniMax

1 author
El Agente Gráfico: Structured Execution Graphs for Scientific Agents

NVIDIA / University of Toronto

Published on: 2026-02-19 1 author
Unified Latents (UL): How to train your latents

Google / Google DeepMind

Published on: 2026-02-19 1 author
Tuning-free Visual Effect Transfer across Videos

Snap / Carnegie Mellon University

Published on: 2026-02-18 1 author
EVMbench: Evaluating AI Agents on Smart Contract Security

OpenAI

Published on: 2026-02-18 1 author
EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data

NVIDIA / University of California

Published on: 2026-02-18 1 author
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

Amazon / UC Berkeley

Published on: 2026-02-17 1 author
jina-embeddings-v5-text: Task-Targeted Embedding Distillation

Jina AI

Published on: 2026-02-17 1 author
World Action Models are Zero-shot Policies

NVIDIA

Published on: 2026-02-17 1 author
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Google / Northwestern University

Published on: 2026-02-17 1 author
GLM-5: from Vibe Coding to Agentic Engineering

Z.ai / Tsinghua University

Published on: 2026-02-17
Image Generation with a Sphere Encoder

Meta Platforms

Published on: 2026-02-16 1 author
BitDance: Scaling Autoregressive Generative Models with Binary Tokens

ByteDance / The Chinese University of Hong Kong

Published on: 2026-02-15 1 author
Experiential Reinforcement Learning

Microsoft / University of Southern California

Published on: 2026-02-15 1 author
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

OpenAI / Amazon, Anthropic, Meta

Published on: 2026-02-15 1 author
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Kuaishou Technology

Published on: 2026-02-14 1 author
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Apple / UC Berkeley

Published on: 2026-02-14 1 author
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

NVIDIA

1 author
WizardLM: Empowering large pre-trained language models to follow complex instructions

Microsoft / Peking University

1 author

1 2 3 4 5 6 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: