Papers
-
Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception
-
Learning to Recorrupt: Noise Distribution Agnostic Self-Supervised Image Denoising
-
PiCSRL: Physics-Informed Contextual Spectral Reinforcement Learning
-
Speech-Synchronized Whiteboard Generation via VLM-Driven Structured Drawing Representations
-
DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease
-
Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment
-
Automated Quality Assessment of Blind Sweep Obstetric Ultrasound for Improved Diagnosis
-
World Reasoning Arena
-
Polarization-Based Eye Tracking with Personalized Siamese Architectures
-
Few Shots Text to Image Retrieval: New Benchmarking Dataset and Optimization Methods
-
THFM: A Unified Video Foundation Model for 4D Human Perception and Beyond
-
Data-Driven Plasticity Modeling via Acoustic Profiling
-
On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins
-
Decoding Defensive Coverage Responsibilities in American Football Using Factorized Attention Based Transformer Models
-
Shared Representation for 3D Pose Estimation, Action Classification, and Progress Prediction from Tactile Signals
-
Parameter-Free Dynamic Regret for Unconstrained Linear Bandits
-
Preventing Data Leakage in EEG-Based Survival Prediction: A Two-Stage Embedding and Transformer Framework
-
Good Scores, Bad Data: A Metric for Multimodal Coherence
-
Personalizing Mathematical Game-based Learning for Children: A Preliminary Study
-
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio
-
DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation
-
DenseSwinV2: Channel Attentive Dual Branch CNN Transformer Learning for Cassava Leaf Disease Classification
-
Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned
-
Reinforcing Structured Chain-of-Thought for Video Understanding
-
Can Small Models Reason About Legal Documents? A Comparative Study
-
Adapting Segment Anything Model 3 for Concept-Driven Lesion Segmentation in Medical Images: An Experimental Study
-
Collision-Aware Vision-Language Learning for End-to-End Driving with Multimodal Infraction Datasets
-
Toward Actionable Digital Twins for Radiation-Based Imaging and Therapy: Mathematical Formulation, Modular Workflow, and an OpenKBP-Based Dose-Surrogate Prototype
-
Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions
-
Low-Rank-Modulated Functa: Exploring the Latent Space of Implicit Neural Representations for Interpretable Ultrasound Video Analysis
-
Online Learning for Dynamic Constellation Topologies
-
EngineAD: A Real-World Vehicle Engine Anomaly Detection Dataset
-
Adversarial-Robust Multivariate Time-Series Anomaly Detection via Joint Information Retention
-
On the Objective and Feature Weights of Minkowski Weighted k-Means
-
When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models
-
BEVMAPMATCH: Multimodal BEV Neural Map Matching for Robust Re-Localization of Autonomous Vehicles
-
Neuro-Cognitive Reward Modeling for Human-Centered Autonomous Vehicle Control
-
MemoryCD: Benchmarking Long-Context User Memory of LLM Agents for Lifelong Cross-Domain Personalization
-
Do Neurons Dream of Primitive Operators? Wake-Sleep Compression Rediscovers Schank's Event Semantics
-
Second-Order, First-Class: A Composable Stack for Curvature-Aware Training
-
Diffusion MRI Transformer with a Diffusion Space Rotary Positional Embedding (D-RoPE)
-
A Priori Sampling of Transition States with Guided Diffusion
-
Policy-Guided World Model Planning for Language-Conditioned Visual Navigation
-
Epileptic Seizure Prediction Using Patient-Adaptive Transformer Networks
-
Natural-Language Agent Harnesses
-
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
-
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data
-
The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More
-
Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs
-
Thinking with Tables: Enhancing Multi-Modal Tabular Understanding via Neuro-Symbolic Reasoning
