Papers
-
Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation
-
GEM: A Native Graph-based Index for Multi-Vector Retrieval
-
Growing Networks with Autonomous Pruning
-
PCSTracker: Long-Term Scene Flow Estimation for Point Cloud Sequences
-
Adapting a Pre-trained Single-Cell Foundation Model to Spatial Gene Expression Generation from Histology Images
-
FREAK: A Fine-grained Hallucination Evaluation Benchmark for Advanced MLLMs
-
High-fidelity Multi-view Normal Integration with Scale-encoded Neural Surface Representation
-
FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision
-
Low-pass Personalized Subgraph Federated Recommendation
-
Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders
-
Template-based Object Detection Using a Foundation Model
-
Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach
-
ReManNet: A Riemannian Manifold Network for Monocular 3D Lane Detection
-
Graph-Aware Text-Only Backdoor Poisoning for Text-Attributed Graphs
-
One Model, Two Minds: Task-Conditioned Reasoning for Unified Image Quality and Aesthetic Assessment
-
Decoupled Sensitivity-Consistency Learning for Weakly Supervised Video Anomaly Detection
-
Embodied Science: Closing the Discovery Loop with Agentic Embodied AI
-
Learning Hierarchical Orthogonal Prototypes for Generalized Few-Shot 3D Point Cloud Segmentation
-
ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents
-
From Plausibility to Verifiability: Risk-Controlled Generative OCR for Vision-Language Models
-
Scalable Learning of Multivariate Distributions via Coresets
-
Interpretable Multiple Myeloma Prognosis with Observational Medical Outcomes Partnership Data
-
Controllable Text-to-Motion Generation via Modular Body-Part Phase Control
-
Borderless Long Speech Synthesis
-
Offshore oil and gas platform dynamics in the North Sea, Gulf of Mexico, and Persian Gulf: Exploiting the Sentinel-1 archive
-
Evaluating Vision Foundation Models for Pixel and Object Classification in Microscopy
-
Quantifying Gate Contribution in Quantum Feature Maps for Scalable Circuit Optimization
-
Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision
-
A graph neural network based chemical mechanism reduction method for combustion applications
-
Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training
-
Eye Gaze-Informed and Context-Aware Pedestrian Trajectory Prediction in Shared Spaces with Automated Shuttles: A Virtual Reality Study
-
GDEGAN: Gaussian Dynamic Equivariant Graph Attention Network for Ligand Binding Site Prediction
-
Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge
-
HUGE-Bench: A Benchmark for High-Level UAV Vision-Language-Action Tasks
-
FrameNet Semantic Role Classification by Analogy
-
FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse and Prover-Effective Autoformalization
-
Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?
-
Fourier Splatting: Generalized Fourier encoded primitives for scalable radiance fields
-
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
-
Explainable cluster analysis: a bagging approach
-
Modeling subgrid scale production rates on complex meshes using graph neural networks
-
Overreliance on AI in Information-seeking from Video Content
-
Bridging the Gap Between Climate Science and Machine Learning in Climate Model Emulation
-
Hyper-Connections for Adaptive Multi-Modal MRI Brain Tumor Segmentation
-
G2DR: A Genotype-First Framework for Genetics-Informed Target Prioritization and Drug Repurposing
-
Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue
-
Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them
-
FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts
-
IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
-
MedQ-Engine: A Closed-Loop Data Engine for Evolving MLLMs in Medical Image Quality Assessment
MongoDB - Build AI That Scales
