Papers
-
Coordinate Encoding on Linear Grids for Physics-Informed Neural Networks
-
TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation
-
Synthetic or Authentic? Building Mental Patient Simulators from Longitudinal Evidence
-
How Far Can VLMs Go for Visual Bug Detection? Studying 19,738 Keyframes from 41 Hours of Gameplay Videos
-
Detecting Non-Membership in LLM Training Data via Rank Correlations
-
Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics
-
Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints
-
PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset
-
Labeled Compression Schemes for Concept Classes of Finite Functions
-
HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment
-
Double Coupling Architecture and Training Method for Optimization Problems of Differential Algebraic Equations with Parameters
-
Spiking Personalized Federated Learning for Brain-Computer Interface-Enabled Immersive Communication
-
Behavioral Heterogeneity as Quantum-Inspired Representation
-
How Utilitarian Are OpenAI's Models Really? Replicating and Reinterpreting Pfeffer, Krügel, and Uhl (2025)
-
SOUPLE: Enhancing Audio-Visual Localization and Segmentation with Learnable Prompt Contexts
-
Explanation Generation for Contradiction Reconciliation with LLMs
-
Multitask-Informed Prior for In-Context Learning on Tabular Data: Application to Steel Property Prediction
-
Algorithmic warm starts for Hamiltonian Monte Carlo
-
Beyond Binary Correctness: Scaling Evaluation of Long-Horizon Agents on Subjective Enterprise Tasks
-
REALITrees: Rashomon Ensemble Active Learning for Interpretable Trees
-
CLiGNet: Clinical Label-Interaction Graph Network for Medical Specialty Classification from Clinical Transcriptions
-
PRISM: A Dual View of LLM Reasoning through Semantic Flow and Latent Computation
-
KALAVAI: Predicting When Independent Specialist Fusion Works -- A Quantitative Model for Post-Hoc Cooperative LLM Training
-
MVPBench: A Multi-Video Perception Evaluation Benchmark for Multi-Modal Video Understanding
-
Multimodal Industrial Anomaly Detection via Geometric Prior
-
Reconstruction-Guided Slot Curriculum: Addressing Object Over-Fragmentation in Video Object-Centric Learning
-
ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding
-
DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona
-
From Overload to Convergence: Supporting Multi-Issue Human-AI Negotiation with Bayesian Visualization
-
Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases
-
From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery
-
From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
-
Explainable Threat Attribution for IoT Networks Using Conditional SHAP and Flow Behavior Modelling
-
Viewport-based Neural 360° Image Compression
-
AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model
-
KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao
-
Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems
-
Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models
-
Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models
-
Exposure-Normalized Bed and Chair Fall Rates via Continuous AI Monitoring
-
Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis
-
Quantum Random Forest for the Regression Problem
-
ABSTRAL: Automatic Design of Multi-Agent Systems Through Iterative Refinement and Topology Optimization
-
Reliable Classroom AI via Neuro-Symbolic Multimodal Reasoning
-
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal
-
Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG
-
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding
-
Span Modeling for Idiomaticity and Figurative Language Detection with Span Contrastive Loss
-
Transformers Trained via Gradient Descent Can Provably Learn a Class of Teacher Models
-
Combinatorial Privacy: Private Multi-Party Bitstream Grand Sum by Hiding in Birkhoff Polytopes
MongoDB - Build AI That Scales
