Papers
-
Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA
-
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
-
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
-
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning
-
WizardLM: Empowering large pre-trained language models to follow complex instructions
-
Florence: A New Foundation Model for Computer Vision
-
Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers
-
KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking
-
Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation
-
ChatLLM network: More brains, more intelligenceBeijing Institute of Technology
-
Voxtral Realtime
-
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
-
OmniSapiens: A Foundation Model for Social Behavior Processing via HARPOMIT, National University of Singapore
-
GameDevBench Evaluating Agentic Capabilities Through Game Development Wayne Chi1 , Yixiong Fang1 , Arnav Yayavaram1 , Siddharth Yayavaram1 , Seth Karten2Carniege Mellon University, Princeton University
-
EchoJEPA: A Latent Predictive Foundation Model for Echocardiography
-
ML-DCN: Masked Low-Rank Deep Crossing Network Towards Scalable Ads Click-through Rate Prediction at Pinterest
-
Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset
-
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
-
Intelligence Explosion
-
Can Post-Training Transform LLMs into Causal Reasoners?Fudan University, Shanghai Artificial Intelligence Laboratory
-
Learning a Generative Meta-Model of LLM ActivationsUC Berkeley
-
Self-Consistency Improves Chain of Thought Reasoning in Language Models
-
Using a GPT-5-driven autonomous lab to optimize the cost and titer of cell-free protein synthesis
-
Knowledge-Intensive AgentsNortheastern University, China
-
Generative Engine Optimization: A VLM and Agent Framework for Pinterest Acquisition Growth
-
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
-
Closing the Loop: Universal Repository Representation with RPG-Encoder
-
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity
-
AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations
-
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems
-
Generative AI for Enzyme Design and Biocatalysis
-
SimMerge: Learning to Select Merge Operators from Similarity Signals
-
Argument Rarity-based Originality Assessment for AI-Assisted WritingRitsumeikan Global Innovation Research Organization
-
AgentRx: Diagnosing AI Agent Failures from Execution Trajectories
-
What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom
-
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Complex Real-World Tasks
-
The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?
-
Lost in Transmission: When and Why LLMs Fail to Reason Globally
-
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
-
Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection SystemsUniversidad Rey Juan Carlos
-
DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice
-
SOFAI-LM: A Cognitive Architecture for Building Efficient and Reliable Reasoning Systems with LLMs
-
Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network PlanningKing Abdullah University of Science and Technology (KAUST)
-
Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language Models
-
OCTOBENCH: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding
-
TranslateGemma Technical Report
-
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
-
Reasoning Models Generate Societies of Thought
-
Hardware Acceleration for Neural Networks: A Comprehensive SurveyArizona State University
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World ModelsThe Hong Kong Polytechnic University
