Papers
-
Abstractive Red-Teaming of Language Model Character
-
Kelix Technical Report
-
DSO: Direct Steering Optimization for Bias Mitigation
-
LLM-in-Sandbox Elicits General Agentic Intelligence
-
Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots
-
On-Policy Context Distillation for Language Models
-
Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers
-
KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking
-
Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation
-
ChatLLM network: More brains, more intelligenceBeijing Institute of Technology
-
CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs
-
Gradients Must Earn Their Influence: Unifying SFT with Generalized Entropic Objectives
-
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
-
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation
-
Voxtral Realtime
-
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
-
OmniSapiens: A Foundation Model for Social Behavior Processing via HARPOMIT, National University of Singapore
-
GameDevBench Evaluating Agentic Capabilities Through Game Development Wayne Chi1 , Yixiong Fang1 , Arnav Yayavaram1 , Siddharth Yayavaram1 , Seth Karten2Carniege Mellon University, Princeton University
-
ArtifactLens: Hundreds of Labels Are Enough for Artifact Detection with VLMs
-
EchoJEPA: A Latent Predictive Foundation Model for Echocardiography
-
Federated Balanced Learning
-
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
-
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning
-
Autoregressive Image Generation with Masked Bit Modeling
-
iGRPO: Self-Feedback–Driven LLM Reasonin
-
ML-DCN: Masked Low-Rank Deep Crossing Network Towards Scalable Ads Click-through Rate Prediction at Pinterest
-
Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset
-
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
-
VirtualEnv: A Platform for Embodied AI Research
-
Intelligence Explosion
-
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving
-
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
-
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
-
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation
-
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
-
Can Post-Training Transform LLMs into Causal Reasoners?Fudan University, Shanghai Artificial Intelligence Laboratory
-
Learning a Generative Meta-Model of LLM ActivationsUC Berkeley
-
Self-Consistency Improves Chain of Thought Reasoning in Language Models
-
Large Language Model Reasoning FailuresCarleton College, Stanford University
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel GenerationsTikTok / Hong Kong University of Science and Technology, Nanyang Technological University, The Chinese University of Hong Kong
-
Learning to Discover at Test Time
-
MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
-
Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review
-
RISE-Video: Can Video Generators Decode Implicit World Rules?
-
Vector Quantization using Gaussian Variational Autoencoder
-
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
-
Using a GPT-5-driven autonomous lab to optimize the cost and titer of cell-free protein synthesis
-
Knowledge-Intensive AgentsNortheastern University, China
-
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers
