Papers
-
Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA
-
Attention Residuals
-
Representation Learning for Spatiotemporal Physical Systems
-
Flowcean - Model Learning for Cyber-Physical Systems
-
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
-
Temporal Straightening for Latent Planning
-
Copula-ResLogit: A Deep-Copula Framework for Unobserved Confounding Effects
-
Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums
-
MultiwayPAM: Multiway Partitioning Around Medoids for LLM-as-a-Judge Score Analysis
-
Quantum entanglement provides a competitive advantage in adversarial games
-
Hybrid Self-evolving Structured Memory for GUI Agents
-
Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation
-
GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification
-
Regime-aware financial volatility forecasting via in-context learning
-
From Imitation to Intuition: Intrinsic Reasoning for Open-Instance Video Classification
-
What do near-optimal learning rate schedules look like?
-
How to make the most of your masked language model for protein engineering
-
Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas
-
Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning
-
Large language models can disambiguate opioid slang on social media
-
The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance
-
NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction
-
PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner
-
Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers
-
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal Models
-
Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation
-
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance
-
On The Complexity of Best-Arm Identification in Non-Stationary Linear Bandits
-
EmoStory: Emotion-Aware Story Generation
-
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck
-
StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References
-
Utility Function is All You Need: LLM-based Congestion Control
-
HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation
-
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination
-
Geometric Autoencoder for Diffusion Models
-
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking
-
Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems
-
GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning
-
Speech Codec Probing from Semantic and Phonetic Perspectives
-
Edge-Assisted Multi-Robot Visual-Inertial SLAM with Efficient Communication
-
Few-Shot Adaptation to Non-Stationary Environments via Latent Trend Embedding for Robotics
-
Reactive Writers: How Co-Writing with AI Changes How We Engage with Ideas
-
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning
-
Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design
-
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
-
Variance-Aware Adaptive Weighting for Diffusion Model Training
-
Safe Probabilistic Planning for Human-Robot Interaction using Conformal Risk Control
-
Graph-GRPO: Training Graph Flow Models with Reinforcement Learning
-
Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities
-
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD
