Papers

Filter by company

GLiGuard: Schema-Conditioned Classification for LLM Safeguard

Published on: 2026-05-08 4 authors
Fast Byte Latent Transformer

Published on: 2026-05-08 8 authors
Long Context Pre-Training with Lighthouse Attention

Published on: 2026-05-07 3 authors
Efficient Pre-Training with Token Superposition

Published on: 2026-05-07 3 authors
Continuous Latent Diffusion Language Model

Published on: 2026-05-07 11 authors
MiniMind-O Technical Report: An Open Small-Scale Speech-Native Omni Model

Published on: 2026-05-05 1 author
VLMaxxing through FrameMogging Training-Free Anti-Recomputation for Video Vision-Language Models

Published on: 2026-05-05 2 authors
Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting

Published on: 2026-05-04 5 authors
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Published on: 2026-05-04 11 authors
Model Spec Midtraining: Improving How Alignment Training Generalizes

Published on: 2026-05-03 4 authors
A Theory of Generalization in Deep Learning

Published on: 2026-05-02 2 authors
Writing Code vs. Shipping Code: Productivity Effects Across Generations of AI Coding Tools

Microsoft / Massachusetts Institute of Technology, National Bureau of Economic Research (NBER), University of Pennsylvania

Published on: 2026-05-01 Venue: NBER Working Paper Series, No. 35275 3 authors
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

Published on: 2026-05-01 9 authors
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning

Published on: 2026-05-01 13 authors
Map2World: Segment Map Conditioned Text to 3D World Generation

Published on: 2026-05-01 5 authors
Let ViT Speak: Generative Language-Image Pre-training

Published on: 2026-05-01 10 authors
Contextual Agentic Memory is a Memo, Not True Memory

Published on: 2026-04-30 3 authors
From Context to Skills: Can Language Models Learn from Context Skillfully?

Published on: 2026-04-30 13 authors
Synthetic Computers at Scale for Long-Horizon Productivity Simulation

Published on: 2026-04-30 4 authors
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

Published on: 2026-04-29 3 authors
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Published on: 2026-04-29 98 authors
DORA: A Scalable Asynchronous Reinforcement Learning System for Language Model Training

Published on: 2026-04-29 18 authors
Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding

Published on: 2026-04-29 18 authors
The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications

Published on: 2026-04-27 6 authors
Representational Curvature Modulates Behavioral Uncertainty in Large Language Models

Published on: 2026-04-27 3 authors
Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver

Published on: 2026-04-27 3 authors
The Last Human-Written Paper: Agent-Native Research Artifacts

Published on: 2026-04-27 37 authors
Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling

Published on: 2026-04-27 10 authors
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Published on: 2026-04-27 12 authors
Kwai Summary Attention Technical Report

Published on: 2026-04-27 38 authors
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Published on: 2026-04-27 15 authors
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Published on: 2026-04-24 8 authors
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Published on: 2026-04-24 42 authors
Video Analysis and Generation via a Semantic Progress Function

Published on: 2026-04-24 5 authors
The Recurrent Transformer: Greater Effective Depth and Efficient Decoding

Published on: 2026-04-23 6 authors
There Will Be a Scientific Theory of Deep Learning

Published on: 2026-04-23 14 authors
Hyperloop Transformers

Published on: 2026-04-23 3 authors
AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use

Published on: 2026-04-23 7 authors
Building a Precise Video Language with Human-AI Oversight

Published on: 2026-04-22 16 authors
SWE-chat: Coding Agent Interactions From Real Users in the Wild

Published on: 2026-04-22 6 authors
Image Generators are Generalist Vision Learners

Published on: 2026-04-22 25 authors
Synthesizing Multi-Agent Harnesses for Vulnerability Discovery

Published on: 2026-04-22 7 authors
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Published on: 2026-04-20 20 authors
OpenGame: Open Agentic Coding for Games

Published on: 2026-04-20 11 authors
Why Fine-Tuning Encourages Hallucinations and How to Fix It

Published on: 2026-04-16 8 authors
Discovering Novel LLM Experts via Task-Capability Coevolution

Published on: 2026-04-16 5 authors
Autonomous Evolution of EDA Tools: Multi-Agent Self-Evolved ABC

Published on: 2026-04-16 2 authors
Language models transmit behavioural traits through hidden signals in data

Anthropic / Alignment Research Center, Anthropic, Truthful AI, UC Berkeley, Warsaw University of Technology

Published on: 2026-04-15 Venue: Nature (Volume 652, Pages 615–621) 9 authors
Accelerating Speculative Decoding with Block Diffusion Draft Trees

Technion – Israel Institute of Technology

Published on: 2026-04-15 2 authors
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems

Published on: 2026-04-14 4 authors

Prev 1 2 3 4 5 6 7 8 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: