Papers
-
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation
-
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
-
Can Post-Training Transform LLMs into Causal Reasoners?Fudan University, Shanghai Artificial Intelligence Laboratory
-
Learning a Generative Meta-Model of LLM ActivationsUC Berkeley
-
Self-Consistency Improves Chain of Thought Reasoning in Language Models
-
Large Language Model Reasoning Failures
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
-
Learning to Discover at Test Time
-
MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
-
Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review
-
RISE-Video: Can Video Generators Decode Implicit World Rules?
-
Vector Quantization using Gaussian Variational Autoencoder
-
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
-
Using a GPT-5-driven autonomous lab to optimize the cost and titer of cell-free protein synthesis
-
Knowledge-Intensive AgentsNortheastern University, China
-
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers
-
Asynchronous Reasoning: Training-Free Interactive Thinking LLMs
-
OpenOneRec Technical Report
-
Learning to Reason in 13 Parameters
-
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
-
LLMs as Orchestrators: Constraint-Compliant Multi-Agent Optimization for Recommendation Systems
-
IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination
-
BlossomRec: Block-level Fused Sparse Attention Mechanism for Sequential Recommendations
-
ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution
-
HY3D-Bench: Generation of 3D Assets
-
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
-
CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability
-
LIVE: Long-horizon Interactive Video World Modeling
-
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
-
Generative Engine Optimization: A VLM and Agent Framework for Pinterest Acquisition Growth
-
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
-
Closing the Loop: Universal Repository Representation with RPG-Encoder
-
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity
-
AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations
-
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems
-
Generative AI for Enzyme Design and Biocatalysis
-
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models
-
HunyuanImage 3.0 Technical Report
-
MAIN-VLA: Modeling Abstraction of Intention and eNvironment for Vision-Language-Action Models
-
OneMall: One Architecture, More Scenarios -- End-to-End Generative Recommender Family at Kuaishou E-Commerce
-
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
-
An Empirical Study on Noisy Data and LLM Pretraining Loss Divergence
-
Interpretable Tabular Foundation Models via In-Context Kernel Regression
-
RFS: Reinforcement Learning with Residual Flow Steering for Dexterous Manipulation
-
CUA-Skill: Develop Skills for Computer Using Agent
-
ReasonCACHE: Teaching LLMs To Reason Without Weight Updates
-
SimMerge: Learning to Select Merge Operators from Similarity Signals
-
Argument Rarity-based Originality Assessment for AI-Assisted WritingRitsumeikan Global Innovation Research Organization
-
AgentRx: Diagnosing AI Agent Failures from Execution Trajectories
-
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests
