Papers
-
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
-
VirtualEnv: A Platform for Embodied AI Research
-
Intelligence Explosion
-
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving
-
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
-
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
-
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation
-
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
-
Can Post-Training Transform LLMs into Causal Reasoners?Fudan University, Shanghai Artificial Intelligence Laboratory
-
Learning a Generative Meta-Model of LLM ActivationsUC Berkeley
-
Self-Consistency Improves Chain of Thought Reasoning in Language Models
-
Large Language Model Reasoning FailuresCarleton College, Stanford University
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel GenerationsTikTok / Hong Kong University of Science and Technology, Nanyang Technological University, The Chinese University of Hong Kong
-
Learning to Discover at Test Time
-
MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
-
Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review
-
RISE-Video: Can Video Generators Decode Implicit World Rules?
-
Vector Quantization using Gaussian Variational Autoencoder
-
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
-
Using a GPT-5-driven autonomous lab to optimize the cost and titer of cell-free protein synthesis
-
Knowledge-Intensive AgentsNortheastern University, China
-
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers
-
Asynchronous Reasoning: Training-Free Interactive Thinking LLMs
-
OpenOneRec Technical Report
-
Learning to Reason in 13 Parameters
-
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
-
LLMs as Orchestrators: Constraint-Compliant Multi-Agent Optimization for Recommendation Systems
-
IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination
-
BlossomRec: Block-level Fused Sparse Attention Mechanism for Sequential Recommendations
-
ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution
-
HY3D-Bench: Generation of 3D Assets
-
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
-
CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability
-
LIVE: Long-horizon Interactive Video World Modeling
-
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
-
Generative Engine Optimization: A VLM and Agent Framework for Pinterest Acquisition Growth
-
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
-
Closing the Loop: Universal Repository Representation with RPG-Encoder
-
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity
-
AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations
-
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems
-
Generative AI for Enzyme Design and Biocatalysis
-
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models
-
HunyuanImage 3.0 Technical Report
-
MAIN-VLA: Modeling Abstraction of Intention and eNvironment for Vision-Language-Action Models
-
OneMall: One Architecture, More Scenarios -- End-to-End Generative Recommender Family at Kuaishou E-Commerce
-
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
-
An Empirical Study on Noisy Data and LLM Pretraining Loss Divergence
-
Interpretable Tabular Foundation Models via In-Context Kernel Regression
-
RFS: Reinforcement Learning with Residual Flow Steering for Dexterous Manipulation
