Papers

Filter by company

Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA

Openmachine

Published on: 2025-06-03 1 author
Attention Residuals

Published on: 2026-03-16 37 authors
Representation Learning for Spatiotemporal Physical Systems

Published on: 2026-03-13 7 authors
Flowcean - Model Learning for Cyber-Physical Systems

Published on: 2026-03-12 7 authors
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

Published on: 2026-03-12 2 authors
Temporal Straightening for Latent Planning

Published on: 2026-03-12 7 authors
Copula-ResLogit: A Deep-Copula Framework for Unobserved Confounding Effects

Published on: 2026-03-11 2 authors
Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums

Published on: 2026-03-11 5 authors
MultiwayPAM: Multiway Partitioning Around Medoids for LLM-as-a-Judge Score Analysis

Published on: 2026-03-11 2 authors
Quantum entanglement provides a competitive advantage in adversarial games

Published on: 2026-03-11 3 authors
Hybrid Self-evolving Structured Memory for GUI Agents

Published on: 2026-03-11 5 authors
Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation

Published on: 2026-03-11 1 author
GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification

Published on: 2026-03-11 3 authors
Regime-aware financial volatility forecasting via in-context learning

Published on: 2026-03-11 3 authors
From Imitation to Intuition: Intrinsic Reasoning for Open-Instance Video Classification

Published on: 2026-03-11 6 authors
What do near-optimal learning rate schedules look like?

Published on: 2026-03-11 4 authors
How to make the most of your masked language model for protein engineering

Published on: 2026-03-11 4 authors
Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas

Published on: 2026-03-11 2 authors
Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

Published on: 2026-03-11 6 authors
Large language models can disambiguate opioid slang on social media

Published on: 2026-03-11 7 authors
The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance

Published on: 2026-03-11 2 authors
NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction

Published on: 2026-03-11 3 authors
PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner

Published on: 2026-03-11 2 authors
Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers

Published on: 2026-03-11 3 authors
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal Models

Published on: 2026-03-11 4 authors
Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation

Published on: 2026-03-11 4 authors
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance

Published on: 2026-03-11 2 authors
On The Complexity of Best-Arm Identification in Non-Stationary Linear Bandits

Published on: 2026-03-11 4 authors
EmoStory: Emotion-Aware Story Generation

Published on: 2026-03-11 3 authors
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck

Published on: 2026-03-11 7 authors
StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References

Published on: 2026-03-11 6 authors
Utility Function is All You Need: LLM-based Congestion Control

Published on: 2026-03-11 3 authors
HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

Published on: 2026-03-11 10 authors
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination

Published on: 2026-03-11 5 authors
Geometric Autoencoder for Diffusion Models

Published on: 2026-03-11 3 authors
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking

Published on: 2026-03-11 5 authors
Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems

Published on: 2026-03-11 1 author
GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning

Published on: 2026-03-11 7 authors
Speech Codec Probing from Semantic and Phonetic Perspectives

Published on: 2026-03-11 6 authors
Edge-Assisted Multi-Robot Visual-Inertial SLAM with Efficient Communication

Published on: 2026-03-11 5 authors
Few-Shot Adaptation to Non-Stationary Environments via Latent Trend Embedding for Robotics

Published on: 2026-03-11 6 authors
Reactive Writers: How Co-Writing with AI Changes How We Engage with Ideas

Published on: 2026-03-11 4 authors
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning

Published on: 2026-03-11 3 authors
Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design

Published on: 2026-03-11 6 authors
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability

Published on: 2026-03-11 4 authors
Variance-Aware Adaptive Weighting for Diffusion Model Training

Published on: 2026-03-11 2 authors
Safe Probabilistic Planning for Human-Robot Interaction using Conformal Risk Control

Published on: 2026-03-11 4 authors
Graph-GRPO: Training Graph Flow Models with Reinforcement Learning

Published on: 2026-03-11 4 authors
Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities

Published on: 2026-03-11 5 authors
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD

Published on: 2026-03-11 7 authors

1 2 3 4 5 6 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: