Papers
-
Beyond the Golden Data: Resolving the Motion-Vision Quality Dilemma via Timestep Selective Training
-
Synchronous Signal Temporal Logic for Decidable Verification of Cyber-Physical Systems
-
BFMD: A Full-Match Badminton Dense Dataset for Dense Shot Captioning
-
Insights on back marking for the automated identification of animals
-
Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence
-
Missing-Aware Multimodal Fusion for Unified Microservice Incident Management
-
PAWS: Perception of Articulation in the Wild at Scale from Egocentric Videos
-
Voxtral TTS
-
Towards Comprehensive Real-Time Scene Understanding in Ophthalmic Surgery through Multimodal Image Fusion
-
An Integrative Genome-Scale Metabolic Modeling and Machine Learning Framework for Predicting and Optimizing Biofuel-Relevant Biomass Production in Saccharomyces cerevisiae
-
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
-
GeoHeight-Bench: Towards Height-Aware Multimodal Reasoning in Remote Sensing
-
Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL
-
TAAC: A gate into Trustable Audio Affective Computing
-
Implicit neural representations for larval zebrafish brain microscopy: a reproducible benchmark on the MapZebrain atlas
-
Cooperative Deep Reinforcement Learning for Fair RIS Allocation
-
Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference
-
The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks
-
UNIC: Neural Garment Deformation Field for Real-time Clothed Character Animation
-
Pure and Physics-Guided Deep Learning Solutions for Spatio-Temporal Groundwater Level Prediction at Arbitrary Locations
-
A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation
-
Spatiotemporal System Forecasting with Irregular Time Steps via Masked Autoencoder
-
DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborative assessment of incidental pulmonary nodules in CT scans: a multi-reader, multi-case trial
-
Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification
-
Social Hippocampus Memory Learning
-
PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency
-
The Geometry of Efficient Nonconvex Sampling
-
Visual or Textual: Effects of Explanation Format and Personal Characteristics on the Perception of Explanations in an Educational Recommender System
-
LanteRn: Latent Visual Structured Reasoning
-
Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?
-
Anchored-Branched Steady-state WInd Flow Transformer (AB-SWIFT): a metamodel for 3D atmospheric flow in urban environments
-
Designing Any Imaging System from Natural Language: Agent-Constrained Composition over a Finite Primitive Basis
-
Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers
-
RenoBench: A Citation Parsing Benchmark
-
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos
-
A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots
-
Calorimeter Shower Superresolution with Conditional Normalizing Flows: Implementation and Statistical Evaluation
-
arg-VU: Affordance Reasoning with Physics-Aware 3D Geometry for Visual Understanding in Robotic Surgery
-
Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance
-
Uncertainty-Guided Label Rebalancing for CPS Safety Monitoring
-
Can Users Specify Driving Speed? Bench2Drive-Speed: Benchmark and Baselines for Desired-Speed Conditioned Autonomous Driving
-
Longitudinal Digital Phenotyping for Early Cognitive-Motor Screening
-
Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors
-
Self-Improvement of Large Language Models: A Technical Overview and Future Outlook
-
Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning
-
Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming
-
On Neural Scaling Laws for Weather Emulation through Continual Training
-
LEMMA: Laplacian pyramids for Efficient Marine SeMAntic Segmentation
-
A Unified Memory Perspective for Probabilistic Trustworthy AI
-
The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase
MongoDB - Build AI That Scales
