Papers

Filter by company

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

MiniMax / Fudan University

Published on: 2026-02-08 1 author
VirtualEnv: A Platform for Embodied AI Research

Sony Group Corporation (AIBO) / MIT

Published on: 2026-02-07 1 author
Intelligence Explosion

1 author
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving

Xiaomi

Published on: 2026-02-06 1 author
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Meituan

Published on: 2026-02-06 1 author
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Meituan

Published on: 2026-02-06 1 author
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation

Microsoft

Published on: 2026-02-06 1 author
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

NVIDIA

Published on: 2026-02-06 1 author
Can Post-Training Transform LLMs into Causal Reasoners?

Fudan University, Shanghai Artificial Intelligence Laboratory

Published on: 2026-02-06 1 author
Learning a Generative Meta-Model of LLM Activations

UC Berkeley

Published on: 2026-02-06 5 authors
Self-Consistency Improves Chain of Thought Reasoning in Language Models

Google / University of Washington

Venue: ICLR 2023 6 authors
Large Language Model Reasoning Failures

Carleton College, Stanford University

Published on: 2026-02-05 3 authors
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

TikTok / Hong Kong University of Science and Technology, Nanyang Technological University, The Chinese University of Hong Kong

Published on: 2026-02-05 7 authors
Learning to Discover at Test Time

Together AI, NVIDIA / Stanford University, UC San Diego

Published on: 2026-02-05 1 author
MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Xiaomi / Huazhong University of Science and Technology

Published on: 2026-02-05 1 author
Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review

Intel

Published on: 2026-02-05 1 author
RISE-Video: Can Video Generators Decode Implicit World Rules?

Tencent / Shanghai Jiao Tong University

Published on: 2026-02-05 1 author
Vector Quantization using Gaussian Variational Autoencoder

Z.ai

Published on: 2026-02-05 1 author
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Microsoft / Tsinghua University

Published on: 2026-02-05 1 author
Using a GPT-5-driven autonomous lab to optimize the cost and titer of cell-free protein synthesis

OpenAI

Published on: 2026-02-05 1 author
Knowledge-Intensive Agents

Northeastern University, China

Published on: 2026-02-05 1 author
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

AMD

Published on: 2026-02-04 1 author
Asynchronous Reasoning: Training-Free Interactive Thinking LLMs

Together AI / The University of Tokyo

Published on: 2026-02-04 1 author
OpenOneRec Technical Report

Kuaishou Technology

Published on: 2026-02-04 1 author
Learning to Reason in 13 Parameters

Meta Platforms / Cornell University

Published on: 2026-02-04 1 author
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Metastone Technology / University of Science and Technology of China

Published on: 2026-02-03 7 authors
LLMs as Orchestrators: Constraint-Compliant Multi-Agent Optimization for Recommendation Systems

Workday

Published on: 2026-02-03 1 author
IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination

Sony Group Corporation (AIBO)

Published on: 2026-02-03 1 author
BlossomRec: Block-level Fused Sparse Attention Mechanism for Sequential Recommendations

Tencent / City University of Hong Kong

Published on: 2026-02-03 1 author
ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution

Tencent / Shanghai Jiao Tong University

Published on: 2026-02-03 1 author
HY3D-Bench: Generation of 3D Assets

Tencent

Published on: 2026-02-03 1 author
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Meituan

Published on: 2026-02-03 1 author
CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability

Kuaishou Technology

Published on: 2026-02-03 1 author
LIVE: Long-horizon Interactive Video World Modeling

Microsoft / The Chinese University of Hong Kong

Published on: 2026-02-03 1 author
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent

Amazon / Carnegie Mellon University

Published on: 2026-02-03 1 author
Generative Engine Optimization: A VLM and Agent Framework for Pinterest Acquisition Growth

Pinterest / Stanford University

Published on: 2026-02-03 1 author
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Google

Published on: 2026-02-03 1 author
Closing the Loop: Universal Repository Representation with RPG-Encoder

Microsoft

Published on: 2026-02-03 1 author
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity

Published on: 2026-02-03 8 authors
AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Published on: 2026-02-03 9 authors
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems

Published on: 2026-02-03 5 authors
Generative AI for Enzyme Design and Biocatalysis

Published on: 2026-02-03 2 authors
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models

Tencent / Tsinghua University

Published on: 2026-02-02 11 authors
HunyuanImage 3.0 Technical Report

Tencent

Published on: 2026-02-02 1 author
MAIN-VLA: Modeling Abstraction of Intention and eNvironment for Vision-Language-Action Models

Tencent

Published on: 2026-02-02 1 author
OneMall: One Architecture, More Scenarios -- End-to-End Generative Recommender Family at Kuaishou E-Commerce

Kuaishou Technology

Published on: 2026-02-02 1 author
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Z.ai / Tsinghua University

Published on: 2026-02-02 1 author
An Empirical Study on Noisy Data and LLM Pretraining Loss Divergence

Meta Platforms

Published on: 2026-02-02 1 author
Interpretable Tabular Foundation Models via In-Context Kernel Regression

Amazon

Published on: 2026-02-02 1 author
RFS: Reinforcement Learning with Residual Flow Steering for Dexterous Manipulation

Amazon / University of Washington

Published on: 2026-02-02 1 author

Prev 35 36 37 38 39 40 41 42 43 44 45 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: