Papers
-
Where can AI be used? Insights from a deep ontology of work activitiesFeatured
-
Developments in Artificial Intelligence markets: New indicators based on model characteristics, prices and providersFeatured
-
MemoryWAM: Efficient World Action Modeling with Persistent Memory
-
Reinforcement Learning Towards Broadly and Persistently Beneficial Models
-
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence
-
Agentic Robot Policy Self-Improvement in the Real World
-
What Must Generalist Agents Remember?
-
Do as I Do: Dexterous Manipulation Data from Everyday Human Videos
-
MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction
-
EgoInfinity: A Web-Scale 4D Hand-Object Interaction Data Engine for Any-View Robot Retargeting and Video-to-Action Robot Learning
-
Looped World Models
-
Variable-Width Transformers
-
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models
-
Latent Thought Flow: Efficient Latent Reasoning in Large Language Models
-
TopoRetarget: Interaction-Preserving Retargeting for Dexterous Manipulation
-
DreamX-World 1.0: A General-Purpose Interactive World Model
-
Human Universal Grasping
-
ART-Glove: Articulated Tactile Glove for Contact-Grounded Dexterous Interaction Capture
-
Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation
-
You Don't Need Strong Assumptions: Visual Representation Learning via Temporal Differences
-
SimWeaver: Zero-Shot RGB Sim-to-Real for Deformable Manipulation
-
Universal Manipulation Exoskeleton: Learning Compliant Whole-body Policies with Real-time Torque Feedback
-
Efficient On-Device Diffusion LLM Inference with Mobile NPU
-
VHDLSuite: Unified Pipeline for LLM VHDL Generation with Data Synthesis and Evaluation
-
$μ_0$: A Scalable 3D Interaction-Trace World Model
-
Surflo: Consistent 3D Surface Flow Model with Global State
-
MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling
-
MiniMax Sparse Attention
-
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement
-
Ambient Diffusion Policy: Imitation Learning from Suboptimal Data in Robotics
-
Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation
-
When is Your LLM Steerable?
-
Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling
-
RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation
-
From AGI to ASI
-
FACTR 2: Learning External Force Sensing for Commodity Robot Arms Improves Policy Learning
-
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning
-
Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models
-
How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope
-
How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope
-
Latent Reasoning with Normalizing Flows
-
Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA
-
MAI-Thinking-1: Building a Hill-Climbing Machine
-
AFUN: Towards an Affordance Foundation Model for Functionality Understanding
-
MPMWorlds: Material-Point-Method Simulations for Inferring and Extrapolating Physical Dynamics
-
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses
-
RealityTest: How People Probe AI Identity and Whether Models Disclose It
-
Representation Forcing for Bottleneck-Free Unified Multimodal Models
-
mRNAutilus: Multi-Objective-Guided Discrete Generation of mRNA with Optimized Therapeutic Properties
