Papers
-
What and When to Learn: CURriculum Ranking Loss for Large-Scale Speaker Verification
-
The Gait Signature of Frailty: Transfer Learning based Deep Gait Models for Scalable Frailty Assessment
-
Enes Causal Discovery
-
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
-
Integrating Causal Machine Learning into Clinical Decision Support Systems: Insights from Literature and Practice
-
Unleashing Vision-Language Semantics for Deepfake Video Detection
-
OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning
-
Mechanic: Sorrifier-Driven Formal Decomposition Workflow for Automated Theorem Proving
-
Counting Without Numbers \& Finding Without Words
-
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?
-
Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufacturing and Usage Variability
-
Composer 2 Technical Report
-
Positive-First Most Ambiguous: A Simple Active Learning Criterion for Interactive Retrieval of Rare Categories
-
Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA
-
Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models
-
Uniform Laws of Large Numbers in Product Spaces
-
Project and Generate: Divergence-Free Neural Operators for Incompressible Flows
-
CRISP: Characterizing Relative Impact of Scholarly Publications
-
Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling
-
Toward Physically Consistent Driving Video World Models under Challenging Trajectories
-
A Firefly Algorithm for Mixed-Variable Optimization Based on Hybrid Distance Modeling
-
AVO: Agentic Variation Operators for Autonomous Evolutionary Search
-
TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models
-
No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions
-
From Liar Paradox to Incongruent Sets: A Normal Form for Self-Reference
-
Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification
-
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
-
Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding
-
Robust Multilingual Text-to-Pictogram Mapping for Scalable Reading Rehabilitation
-
CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition
-
SEGAR: Selective Enhancement for Generative Augmented Reality
-
Analysing the Safety Pitfalls of Steering Vectors
-
A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English
-
The role of spatial context and multitask learning in the detection of organic and conventional farming systems based on Sentinel-2 time series
-
Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch
-
Energy-Efficient Hierarchical Federated Anomaly Detection for the Internet of Underwater Things via Selective Cooperative Aggregation
-
MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies
-
Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents
-
LensWalk: Agentic Video Understanding by Planning How You See in Videos
-
The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems
-
Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction
-
Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling
-
POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan
-
Anti-I2V: Safeguarding your photos from malicious image-to-video generation
-
Completeness of Unbounded Best-First Minimax and Descent Minimax
-
Towards Training-Free Scene Text Editing
-
VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models
-
Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation
-
EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction
-
Vision-Language Models vs Human: Perceptual Image Quality Assessment
MongoDB - Build AI That Scales
