Papers
-
GLiGuard: Schema-Conditioned Classification for LLM Safeguard
-
Fast Byte Latent Transformer
-
Long Context Pre-Training with Lighthouse Attention
-
Efficient Pre-Training with Token Superposition
-
Continuous Latent Diffusion Language Model
-
MiniMind-O Technical Report: An Open Small-Scale Speech-Native Omni Model
-
VLMaxxing through FrameMogging Training-Free Anti-Recomputation for Video Vision-Language Models
-
Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting
-
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness
-
Model Spec Midtraining: Improving How Alignment Training Generalizes
-
A Theory of Generalization in Deep Learning
-
Writing Code vs. Shipping Code: Productivity Effects Across Generations of AI Coding ToolsMicrosoft / Massachusetts Institute of Technology, National Bureau of Economic Research (NBER), University of Pennsylvania
-
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs
-
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning
-
Map2World: Segment Map Conditioned Text to 3D World Generation
-
Let ViT Speak: Generative Language-Image Pre-training
-
Contextual Agentic Memory is a Memo, Not True Memory
-
From Context to Skills: Can Language Models Learn from Context Skillfully?
-
Synthetic Computers at Scale for Long-Horizon Productivity Simulation
-
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
-
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
-
DORA: A Scalable Asynchronous Reinforcement Learning System for Language Model Training
-
Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding
-
The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications
-
Representational Curvature Modulates Behavioral Uncertainty in Large Language Models
-
Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver
-
The Last Human-Written Paper: Agent-Native Research Artifacts
-
Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling
-
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
-
Kwai Summary Attention Technical Report
-
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
-
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
-
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond
-
Video Analysis and Generation via a Semantic Progress Function
-
The Recurrent Transformer: Greater Effective Depth and Efficient Decoding
-
There Will Be a Scientific Theory of Deep Learning
-
Hyperloop Transformers
-
AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use
-
Building a Precise Video Language with Human-AI Oversight
-
SWE-chat: Coding Agent Interactions From Real Users in the Wild
-
Image Generators are Generalist Vision Learners
-
Synthesizing Multi-Agent Harnesses for Vulnerability Discovery
-
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence
-
OpenGame: Open Agentic Coding for Games
-
Why Fine-Tuning Encourages Hallucinations and How to Fix It
-
Discovering Novel LLM Experts via Task-Capability Coevolution
-
Autonomous Evolution of EDA Tools: Multi-Agent Self-Evolved ABC
-
Language models transmit behavioural traits through hidden signals in dataAnthropic / Alignment Research Center, Anthropic, Truthful AI, UC Berkeley, Warsaw University of Technology
-
Accelerating Speculative Decoding with Block Diffusion Draft TreesTechnion – Israel Institute of Technology
-
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems
MongoDB - Build AI That Scales
