Papers
-
Magic Words or Methodical Work? Challenging Conventional Wisdom in LLM-Based Political Text Annotation
-
Computer Vision with a Superpixelation Camera
-
Are LLMs Good For Quantum Software, Architecture, and System Design?
-
FusionAgent: A Multimodal Agent with Dynamic Model Selection for Human Recognition
-
Comparing Physics-Informed and Neural ODE Approaches for Modeling Nonlinear Biological Systems: A Case Study Based on the Morris-Lecar Model
-
Mimetic Alignment with ASPECT: Evaluation of AI-inferred Personal Profiles
-
Koopman Operator Identification of Model Parameter Trajectories for Temporal Domain Generalization (KOMET)
-
Live Interactive Training for Video Segmentation
-
In your own words: computationally identifying interpretable themes in free-text survey data
-
Tunable Domain Adaptation Using Unfolding
-
Leveraging Avatar Fingerprinting: A Multi-Generator Photorealistic Talking-Head Public Database and Benchmark
-
From 3D Pose to Prose: Biomechanics-Grounded Vision--Language Coaching
-
Multilingual Stutter Event Detection for English, German, and Mandarin Speech
-
Static and Dynamic Approaches to Computing Barycenters of Probability Measures on Graphs
-
Neuro-Symbolic Learning for Predictive Process Monitoring via Two-Stage Logic Tensor Networks with Rule Pruning
-
Real-time Appearance-based Gaze Estimation for Open Domains
-
Compliance-Aware Predictive Process Monitoring: A Neuro-Symbolic Approach
-
Multimodal Deep Learning for Diabetic Foot Ulcer Staging Using Integrated RGB and Thermal Imaging
-
ASTER -- Agentic Science Toolkit for Exoplanet Research
-
High dimensional theory of two-phase optimizers
-
On the Optimal Number of Grids for Differentially Private Non-Interactive $K$-Means Clustering
-
Neural Approximation of Generalized Voronoi Diagrams
-
Graph Attention Network-Based Detection of Autism Spectrum Disorder
-
Probabilistic Forecasting of Localized Wildfire Spread Based on Conditional Flow Matching
-
Beyond Mortality: Advancements in Post-Mortem Iris Recognition through Data Collection and Computer-Aided Forensic Examination
-
Online Statistical Inference of Constant Sample-averaged Q-Learning
-
Transparency as Architecture: Structural Compliance Gaps in EU AI Act Article 50 II
-
A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models
-
A large corpus of lucid and non-lucid dream reports
-
On the Reliability Limits of LLM-Based Multi-Agent Planning
-
ImmSET: Sequence-Based Predictor of TCR-pMHC Specificity at Scale
-
FormalProofBench: Can Models Write Graduate Level Math Proofs That Are Formally Verified?
-
Beyond Freshness and Semantics: A Coupon-Collector Framework for Effective Status Updates
-
AutoSiMP: Autonomous Topology Optimization from Natural Language via LLM-Driven Problem Configuration and Adaptive Solver Control
-
PHONOS: PHOnetic Neutralization for Online Streaming Applications
-
The Last Fingerprint: How Markdown Training Shapes LLM Prose
-
RASPRef: Retrieval-Augmented Self-Supervised Prompt Refinement for Large Reasoning Models
-
UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation
-
GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection
-
Generative Shape Reconstruction with Geometry-Guided Langevin Dynamics
-
On-Device Super Resolution Imaging Using Low-Cost SPAD Array and Embedded Lightweight Deep Learning
-
Parameter Estimation in Stochastic Differential Equations via Wiener Chaos Expansion and Stochastic Gradient Descent
-
Pashto Common Voice: Building the First Open Speech Corpus for a 60-Million-Speaker Low-Resource Language
-
TAPS: Task Aware Proposal Distributions for Speculative Sampling
-
YOLO Object Detectors for Robotics -- a Comparative Study
-
RealBirdID: Benchmarking Bird Species Identification in the Era of MLLMs
-
Material Identification using Multi-Modal Intrinsic Radiation and Radiography
-
Unified Number-Free Text-to-Motion Generation Via Flow Matching
-
CREST: Constraint-Release Execution for Multi-Robot Warehouse Shelf Rearrangement
-
Introducing MELI: the Mandarin-English Language Interview Corpus
MongoDB - Build AI That Scales
