Papers
-
Pareto-Optimal Anytime Algorithms via Bayesian Racing
-
First-Order Geometry, Spectral Compression, and Structural Compatibility under Bounded Computation
-
Efficient Credal Prediction through Decalibration
-
Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models
-
All Vehicles Can Lie: Efficient Adversarial Defense in Fully Untrusted-Vehicle Collaborative Perception via Pseudo-Random Bayesian Inference
-
Improving Continual Learning for Gaussian Splatting based Environments Reconstruction on Commercial Off-the-Shelf Edge Devices
-
Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA
-
Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction
-
Echo2ECG: Enhancing ECG Representations with Cardiac Morphology from Multi-View Echos
-
Oracle-Guided Soft Shielding for Safe Move Prediction in Chess
-
Beyond Hungarian: Match-Free Supervision for End-to-End Object Detection
-
Breaking the Bias Barrier in Concave Multi-Objective Reinforcement Learning
-
OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras
-
BuildMamba: A Visual State-Space Based Model for Multi-Task Building Segmentation and Height Estimation from Satellite Images
-
Towards Effective and Efficient Graph Alignment without Supervision
-
SecAgent: Efficient Mobile GUI Agent with Semantic Context
-
SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution
-
PCFEx: Point Cloud Feature Extraction for Graph Neural Networks
-
The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search
-
Interactive World Simulator for Robot Policy Training and Evaluation
-
mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point Cloud
-
Generative Adversarial Regression (GAR): Learning Conditional Risk Scenarios
-
Impact of Connectivity on Laplacian Representations in Reinforcement Learning
-
BioGait-VLM: A Tri-Modal Vision-Language-Biomechanics Framework for Interpretable Clinical Gait Assessment
-
OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security
-
MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation
-
Trust via Reputation of Conviction
-
Drift-to-Action Controllers: Budgeted Interventions with Online Risk Certificates
-
Online Sparse Synthetic Aperture Radar Imaging
-
DualFlexKAN: Dual-stage Kolmogorov-Arnold Networks with Independent Function Control
-
Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control
-
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
-
PRISM: Streaming Human Motion Generation with Per-Joint Latent Decomposition
-
Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene Representations
-
Don't Look Back in Anger: MAGIC Net for Streaming Continual Learning with Temporal Dependence
-
Weakly Supervised Teacher-Student Framework with Progressive Pseudo-mask Refinement for Gland Segmentation
-
FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection
-
Micro-Diffusion Compression - Binary Tree Tweedie Denoising for Online Probability Estimation
-
StreamReady: Learning What to Answer and When in Long Streaming Videos
-
Integral Formulas for Vector Spherical Tensor Products
-
UNBOX: Unveiling Black-box visual models with Natural-language
-
OmniGuide: Universal Guidance Fields for Enhancing Generalist Robot Policies
-
Retrieval-Augmented Gaussian Avatars: Improving Expression Generalization
-
Grow, Don't Overwrite: Fine-tuning Without Forgetting
-
CAST: Modeling Visual State Transitions for Consistent Video Retrieval
-
Divide and Predict: An Architecture for Input Space Partitioning and Enhanced Accuracy
-
Group Entropies and Mirror Duality: A Class of Flexible Mirror Descent Updates for Machine Learning
-
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
-
Cluster-Aware Attention-Based Deep Reinforcement Learning for Pickup and Delivery Problems
-
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning
