Papers
-
Factored Latent Action World Models
-
Tuning-free Visual Effect Transfer across Videos
-
EVMbench: Evaluating AI Agents on Smart Contract Security
-
EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data
-
Statistical approximation is not general intelligenceNew York University, Sapienza University of Rome, University of Milan-Bicocca
-
GLM-5: from Vibe Coding to Agentic Engineering
-
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
-
jina-embeddings-v5-text: Task-Targeted Embedding Distillation
-
World Action Models are Zero-shot Policies
-
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
-
GLM-5: from Vibe Coding to Agentic Engineering
-
Image Generation with a Sphere Encoder
-
GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training
-
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality AttentionTencent / Hunan University, National University of Singapore, The Chinese University of Hong Kong, Tsinghua University, Xi'an Jiaotong University
-
BitDance: Scaling Autoregressive Generative Models with Binary Tokens
-
Experiential Reinforcement Learning
-
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
-
Speculative Decoding with a Speculative Vocabulary
-
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
-
Hippocampus: An Efficient and Scalable Memory Module for Agentic AI
-
Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series
-
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
-
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment
-
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot LearningNVIDIA / Georgia Institute of TechnologyUniversity of Texas, Massachusetts Institute of Technology, Robotics and AI Institute, Swiss Federal Institute of Technology in Zurich, University of California, University of Southern California, University of Texas, University of Toronto
-
WizardLM: Empowering large pre-trained language models to follow complex instructions
-
Florence: A New Foundation Model for Computer Vision
-
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
-
GISA: A Benchmark for General Information-Seeking Assistant
-
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse TasksAmazon / Boston University, Carnegie Mellon University, Columbia University, Dartmouth College, Duke University, Michigan State University, Princeton University, Stanford University, The Ohio State University, University of California, University of Oxford, University of Southern California, University of Texas
-
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
-
Think like a Scientist: Physics-guided LLM Agent for Equation DiscoveryUniversity of California
-
Intelligent AI Delegation
-
AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection
-
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model
-
LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts
-
Abstractive Red-Teaming of Language Model Character
-
Kelix Technical Report
-
DSO: Direct Steering Optimization for Bias Mitigation
-
LLM-in-Sandbox Elicits General Agentic Intelligence
-
Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots
-
On-Policy Context Distillation for Language Models
-
Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers
-
KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking
-
Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation
-
ChatLLM network: More brains, more intelligenceBeijing Institute of Technology
-
RISE: Self-Improving Robot Policy with Compositional World Model
-
CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs
-
Gradients Must Earn Their Influence: Unifying SFT with Generalized Entropic Objectives
-
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
MongoDB - Build AI That Scales
