Papers
-
Scaling Laws for Agent Harnesses via Effective Feedback Compute
-
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents
-
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
-
AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation
-
Self-Improving Language Models with Bidirectional Evolutionary Search
-
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
-
Elias in the Lighthouse, Again? Diagnosing Low Diversity in LLM Stories
-
Laguna M.1/XS.2 Technical Report
-
Learn from your own latents and not from tokens: A sample-complexity theory
-
MobileMoE: Scaling On-Device Mixture of Experts
-
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini
-
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
-
When Does LeJEPA Learn a World Model?
-
Unified Neural Scaling Laws
-
Language Models Need Sleep
-
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence
-
Training-Free Looped Transformers
-
Polar: Agentic RL on Any Harness at Scale
-
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction
-
SkillOpt: Executive Strategy for Self-Evolving Agent Skills
-
Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings
-
Forecasting Scientific Progress with Artificial Intelligence
-
Vector Policy Optimization: Training for Diversity Improves Test-Time Search
-
Advancing Mathematics Research with AI-Driven Formal Proof Search
-
HRM-Text: Efficient Pretraining Beyond Scaling
-
HRM-Text: Efficient Pretraining Beyond Scaling
-
OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization
-
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation
-
CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition
-
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration
-
Spectral classification of brown dwarfs using machine learning
-
Generative Recursive Reasoning
-
WavFlow: Audio Generation in Waveform Space
-
Stable Audio 3
-
ASPI: Seeking Ambiguity Clarification Amplifies Prompt Injection Vulnerability in LLM Agents
-
Look Before You Leap: Autonomous Exploration for LLM Agents
-
ReactiveGWM: Steering NPC in Reactive Game World Models
-
Is Grep All You Need? How Agent Harnesses Reshape Agentic Search
-
FutureSim: Replaying World Events to Evaluate Adaptive Agents
-
Self-Distilled Agentic Reinforcement Learning
-
Useful Memories Become Faulty When Continuously Updated by LLMs
-
Targeted Neuron Modulation via Contrastive Pair Search
-
Slicing and Dicing: Configuring Optimal Mixtures of Experts
-
$δ$-mem: Efficient Online Memory for Large Language Models
-
Solve the Loop: Attractor Models for Language and Reasoning
-
ELF: Embedded Language Flows
-
Reinforce Adjoint Matching: Scaling RL Post-Training of Diffusion and Flow-Matching Models
-
The Truth Lies Somewhere in the Middle (of the Generated Tokens)
-
Qwen-Image-2.0 Technical Report
-
Quantifying the Utility of User Simulators for Building Collaborative LLM Assistants
MongoDB - Build AI That Scales
