Papers
-
SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation
-
FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID Data
-
Quantifying Memorization and Privacy Risks in Genomic Language Models
-
Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates
-
Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical Reasoning
-
Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents
-
Quantifying Uncertainty in AI Visibility: A Statistical Framework for Generative Search Measurement
-
MEGC2026: Micro-Expression Grand Challenge on Visual Question Answering
-
TIDE: Text-Informed Dynamic Extrapolation with Step-Aware Temperature Control for Diffusion Transformers
-
Using Vision Language Foundation Models to Generate Plant Simulation Configurations via In-Context Learning
-
Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity Networks
-
Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance
-
PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration
-
VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs
-
Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead
-
BiCLIP: Domain Canonicalization via Structured Geometric Transformation
-
Kernel Debiased Plug-in Estimation based on the Universal Least Favorable Submodel
-
Towards Reliable Simulation-based Inference
-
A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations
-
A Survey of Reinforcement Learning For Economics
-
Automated Tensor-Relational Decomposition for Large-Scale Sparse Tensor Computation
-
The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference
-
The FABRIC Strategy for Verifying Neural Feedback Systems
-
Semantic Level of Detail: Multi-Scale Knowledge Representation via Heat Kernel Diffusion on Hyperbolic Manifolds
-
Can You Hear, Localize, and Segment Continually? An Exemplar-Free Continual Learning Benchmark for Audio-Visual Segmentation
-
MAcPNN: Mutual Assisted Learning on Data Streams with Temporal Dependence
-
Data-driven robust Markov decision processes on Borel spaces: performance guarantees via an axiomatic approach
-
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing
-
SurgCalib: Gaussian Splatting-Based Hand-Eye Calibration for Robot-Assisted Minimally Invasive Surgery
-
MAPLE: Elevating Medical Reasoning from Statistical Consensus to Process-Led Alignment
-
Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance
-
Arbiter: Detecting Interference in LLM Agent System Prompts
-
SkipGS: Post-Densification Backward Skipping for Efficient 3DGS Training
-
Diffusion-Based Authentication of Copy Detection Patterns: A Multimodal Framework with Printer Signature Conditioning
-
Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning
-
Security Considerations for Multi-agent Systems
-
Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis
-
Statistical Inference via Generative Models: Flow Matching and Causal Inference
-
Improving through Interaction: Searching Behavioral Representation Spaces with CMA-ES-IG
-
The Coupling Within: Flow Matching via Distilled Normalizing Flows
-
An accurate flatness measure to estimate the generalization performance of CNN models
-
Meissa: Multi-modal Medical Agentic Intelligence
-
AI Phenomenology for Understanding Human-AI Experiences Across Eras
-
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games
-
The Missing Memory Hierarchy: Demand Paging for LLM Context Windows
-
When to Retrain after Drift: A Data-Only Test of Post-Drift Data Size Sufficiency
-
Automating Detection and Root-Cause Analysis of Flaky Tests in Quantum Software
-
SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans
-
Guess & Guide: Gradient-Free Zero-Shot Diffusion Guidance
-
An Interpretable Generative Framework for Anomaly Detection in High-Dimensional Financial Time Series
