Papers
-
From Pixels to Privacy: Temporally Consistent Video Anonymization via Token Pruning for Privacy Preserving Action Recognition
-
Curvature-aware Expected Free Energy as an Acquisition Function for Bayesian Optimization
-
HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network
-
A Power-Weighted Noncentral Complex Gaussian Distribution
-
Reflect to Inform: Boosting Multimodal Reasoning via Information-Gain-Driven Verification
-
Generative Score Inference for Multimodal Data
-
DuSCN-FusionNet: An Interpretable Dual-Channel Structural Covariance Fusion Framework for ADHD Classification Using Structural MRI
-
Only Whats Necessary: Pareto Optimal Data Minimization for Privacy Preserving Video Anomaly Detection
-
From Pen to Pixel: Translating Hand-Drawn Plots into Graphical APIs via a Novel Benchmark and Efficient Adapter
-
MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
-
Automated near-term quantum algorithm discovery for molecular ground states
-
HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models
-
A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models
-
Dynamic Token Compression for Efficient Video Understanding through Reinforcement Learning
-
Generative Modeling in Protein Design: Neural Representations, Conditional Generation, and Evaluation Standards
-
Switch Attention: Towards Dynamic and Fine-grained Hybrid Transformers
-
Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration
-
Maintaining Difficulty: A Margin Scheduler for Triplet Loss in Siamese Networks Training
-
Adapting Frozen Mono-modal Backbones for Multi-modal Registration via Contrast-Agnostic Instance Optimization
-
SHANDS: A Multi-View Dataset and Benchmark for Surgical Hand-Gesture and Error Recognition Toward Medical Training
-
Word Alignment-Based Evaluation of Uniform Meaning Representations
-
AFSS: Artifact-Focused Self-Synthesis for Mitigating Bias in Audio Deepfake Detection
-
Why Models Know But Don't Say: Chain-of-Thought Faithfulness Divergence Between Thinking Tokens and Answers in Open-Weight Reasoning Models
-
KMM-CP: Practical Conformal Prediction under Covariate Shift via Selective Kernel Mean Matching
-
Kantorovich--Kernel Neural Operators: Approximation Theory, Asymptotics, and Neural Network Interpretation
-
A Hierarchical Sheaf Spectral Embedding Framework for Single-Cell RNA-seq Analysis
-
CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities
-
Analysing Calls to Order in German Parliamentary Debates
-
Reconstructing Quantum Dot Charge Stability Diagrams with Diffusion Models
-
Automating Clinical Information Retrieval from Finnish Electronic Health Records Using Large Language Models
-
Interpretable long-term traffic modelling on national road networks using theory-informed deep learning
-
Image-based Quantification of Postural Deviations on Patients with Cervical Dystonia: A Machine Learning Approach Using Synthetic Training Data
-
Meta-Learned Adaptive Optimization for Robust Human Mesh Recovery with Uncertainty-Aware Parameter Updates
-
ClimateCheck 2026: Scientific Fact-Checking and Disinformation Narrative Classification of Climate-related Claims
-
Fair Data Pre-Processing with Imperfect Attribute Space
-
Can AI Models Direct Each Other? Organizational Structure as a Probe into Training Limitations
-
Neuro-Symbolic Process Anomaly Detection
-
Automatic feature identification in least-squares policy iteration using the Koopman operator framework
-
A Boltzmann-machine-enhanced Transformer For DNA Sequence Classification
-
HyVIC: A Metric-Driven Spatio-Spectral Hyperspectral Image Compression Architecture Based on Variational Autoencoders
-
UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models
-
Foundation Model for Cardiac Time Series via Masked Latent Attention
-
Shapley meets Rawls: an integrated framework for measuring and explaining unfairness
-
SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras
-
SPECTRA: An Efficient Spectral-Informed Neural Network for Sensor-Based Activity Recognition
-
EcoFair: Trustworthy and Energy-Aware Routing for Privacy-Preserving Vertically Partitioned Medical Inference
-
ClipTTT: CLIP-Guided Test-Time Training Helps LVLMs See Better
-
Conditional Neural Bayes Ratio Estimation for Experimental Design Optimisation
-
Entanglement as Memory: Mechanistic Interpretability of Quantum Language Models
-
Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference
MongoDB - Build AI That Scales
