Papers

Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA

Openmachine

Published on: 2025-06-03 1 author
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

OpenAI / Amazon, Anthropic, Meta

Published on: 2026-02-15 1 author
Qwen3 Technical Report

Alibaba

1 author
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

NVIDIA

1 author
WizardLM: Empowering large pre-trained language models to follow complex instructions

Microsoft / Peking University

1 author
Florence: A New Foundation Model for Computer Vision

Microsoft

1 author
Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers

Amazon / Sapienza University

Published on: 2026-02-12 1 author
KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking

Amazon / Universidade Federal Fluminense

Published on: 2026-02-12 1 author
Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation

Amazon / UC Santa Cruz

Published on: 2026-02-12 1 author
ChatLLM network: More brains, more intelligence

Beijing Institute of Technology

Published on: 2026-02-12 1 author
OmniSapiens: A Foundation Model for Social Behavior Processing via HARPO

MIT, National University of Singapore

Published on: 2026-02-11 1 author
GameDevBench Evaluating Agentic Capabilities Through Game Development Wayne Chi1 , Yixiong Fang1 , Arnav Yayavaram1 , Siddharth Yayavaram1 , Seth Karten2

Carniege Mellon University, Princeton University

Published on: 2026-02-11 1 author
Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset

Anthropic / Northeastern University, China

Published on: 2026-02-09 1 author
Intelligence Explosion

1 author
Can Post-Training Transform LLMs into Causal Reasoners?

Fudan University, Shanghai Artificial Intelligence Laboratory

Published on: 2026-02-06 1 author
Learning a Generative Meta-Model of LLM Activations

UC Berkeley

Published on: 2026-02-06 5 authors
Self-Consistency Improves Chain of Thought Reasoning in Language Models

Google / University of Washington

Venue: ICLR 2023 6 authors
Using a GPT-5-driven autonomous lab to optimize the cost and titer of cell-free protein synthesis

OpenAI

Published on: 2026-02-05 1 author
Knowledge-Intensive Agents

Northeastern University, China

Published on: 2026-02-05 1 author
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Google

Published on: 2026-02-03 1 author
Closing the Loop: Universal Repository Representation with RPG-Encoder

Microsoft

Published on: 2026-02-03 1 author
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity

Published on: 2026-02-03 8 authors
AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Published on: 2026-02-03 9 authors
Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems

Published on: 2026-02-03 5 authors
Generative AI for Enzyme Design and Biocatalysis

Published on: 2026-02-03 2 authors
Argument Rarity-based Originality Assessment for AI-Assisted Writing

Ritsumeikan Global Innovation Research Organization

Published on: 2026-02-02 1 author
AgentRx: Diagnosing AI Agent Failures from Execution Trajectories

Microsoft

Published on: 2026-02-02 6 authors
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Complex Real-World Tasks

Published on: 2026-02-01 9 authors
Lost in Transmission: When and Why LLMs Fail to Reason Globally

Microsoft

Published on: 2026-01-30 1 author
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Google / University of Washington

Published on: 2026-01-28 Venue: NeurIPS 2022 8 authors
Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection Systems

Universidad Rey Juan Carlos

Published on: 2026-01-23 2 authors
SOFAI-LM: A Cognitive Architecture for Building Efficient and Reliable Reasoning Systems with LLMs

IBM

Published on: 2026-01-20 1 author
Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network Planning

King Abdullah University of Science and Technology (KAUST)

Published on: 2026-01-20 3 authors
Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language Models

Published on: 2026-01-19 2 authors
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Published on: 2026-01-16 13 authors
Hardware Acceleration for Neural Networks: A Comprehensive Survey

Arizona State University

Published on: 2026-01-15 1 author
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

The Hong Kong Polytechnic University

Published on: 2026-01-13 5 authors
AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture

Published on: 2026-01-13 7 authors
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities.

Google

Published on: 2025-12-19
Addendum to GPT-5.2 System Card: GPT-5.2-Codex

OpenAI

Published on: 2025-12-18 1 author
Monitoring Monitorability

OpenAI

Published on: 2025-12-18 1 author
Evaluating AI’s ability to perform scientific research tasks

OpenAI

Published on: 2025-12-16 1 author
On Learning-Curve Monotonicity for Maximum Likelihood Estimators

OpenAI

Published on: 2025-12-11 1 author
Training LLMs for Honesty via Confessions

OpenAI

Published on: 2025-12-03
Early Science Acceleration Experiments with GPT-5

OpenAI

Published on: 2025-11-20 1 author
Weight-Sparse Transformers Have Interpretable Circuits

OpenAI

Published on: 2025-11-17 1 author
SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling

Microsoft / University of Illinois Urbana-Champaign

Published on: 2025-11-12 1 author
Signs of introspection in large language models

Anthropic

Published on: 2025-10-29 1 author
Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Apple

Published on: 2025-10-28 1 author
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping

Apple / Purdue University

Published on: 2025-10-25 1 author

1 2 3 4 5 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: