Papers
-
Statistical approximation is not general intelligenceNew York University, Sapienza University of Rome, University of Milan-Bicocca
-
GLM-5: from Vibe Coding to Agentic Engineering
-
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
-
jina-embeddings-v5-text: Task-Targeted Embedding Distillation
-
World Action Models are Zero-shot Policies
-
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
-
GLM-5: from Vibe Coding to Agentic Engineering
-
Image Generation with a Sphere Encoder
-
GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training
-
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality AttentionTencent / Hunan University, National University of Singapore, The Chinese University of Hong Kong, Tsinghua University, Xi'an Jiaotong University
-
BitDance: Scaling Autoregressive Generative Models with Binary Tokens
-
Experiential Reinforcement Learning
-
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
-
Speculative Decoding with a Speculative Vocabulary
-
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
-
Hippocampus: An Efficient and Scalable Memory Module for Agentic AI
-
Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series
-
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
-
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment
-
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot LearningNVIDIA / Georgia Institute of TechnologyUniversity of Texas, Massachusetts Institute of Technology, Robotics and AI Institute, Swiss Federal Institute of Technology in Zurich, University of California, University of Southern California, University of Texas, University of Toronto
-
WizardLM: Empowering large pre-trained language models to follow complex instructions
-
Florence: A New Foundation Model for Computer Vision
-
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
-
GISA: A Benchmark for General Information-Seeking Assistant
-
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse TasksAmazon / Boston University, Carnegie Mellon University, Columbia University, Dartmouth College, Duke University, Michigan State University, Princeton University, Stanford University, The Ohio State University, University of California, University of Oxford, University of Southern California, University of Texas
-
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
-
Think like a Scientist: Physics-guided LLM Agent for Equation DiscoveryUniversity of California
-
Intelligent AI Delegation
-
AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection
-
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model
-
LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts
-
Abstractive Red-Teaming of Language Model Character
-
Kelix Technical Report
-
DSO: Direct Steering Optimization for Bias Mitigation
-
LLM-in-Sandbox Elicits General Agentic Intelligence
-
Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots
-
On-Policy Context Distillation for Language Models
-
Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers
-
KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking
-
Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation
-
ChatLLM network: More brains, more intelligenceBeijing Institute of Technology
-
CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs
-
Gradients Must Earn Their Influence: Unifying SFT with Generalized Entropic Objectives
-
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
-
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation
-
Voxtral Realtime
-
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
-
OmniSapiens: A Foundation Model for Social Behavior Processing via HARPOMassachusetts Institute of Technology, Nanyang Technological University, National University of Singapore, Qatar Computing Research Institute, University of Rochester
-
GameDevBench Evaluating Agentic Capabilities Through Game Development Wayne Chi1 , Yixiong Fang1 , Arnav Yayavaram1 , Siddharth Yayavaram1 , Seth Karten2Carnegie Mellon University, Princeton University
MongoDB - Build AI That Scales
