Papers
-
DBAutoDoc: Automated Discovery and Documentation of Undocumented Database Schemas via Statistical Analysis and Iterative LLM Refinement
-
Post-Selection Distributional Model Evaluation
-
Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition
-
Minibal: Balanced Game-Playing Without Opponent Modeling
-
Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review
-
Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution
-
StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation
-
MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding
-
AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing
-
PolarAPP: Beyond Polarization Demosaicking for Polarimetric Applications
-
Generalization Bounds for Physics-Informed Neural Networks for the Incompressible Navier-Stokes Equations
-
Can an LLM Detect Instances of Microservice Infrastructure Patterns?
-
MsFormer: Enabling Robust Predictive Maintenance Services for Industrial Devices
-
MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models
-
Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards
-
A Synchronized Audio-Visual Multi-View Capture System
-
When Language Models Lose Their Mind: The Consequences of Brain Misalignment
-
SpecXMaster Technical Report
-
NeuroSeg Meets DINOv3: Transferring 2D Self-Supervised Visual Priors to 3D Neuron Segmentation via DINOv3 Initialization
-
High-Resolution Tensor-Network Fourier Methods for Exponentially Compressed Non-Gaussian Aggregate Distributions
-
Dual-Criterion Curriculum Learning: Application to Temporal Data
-
Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment
-
AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection
-
Automatic Segmentation of 3D CT scans with SAM2 using a zero-shot approach
-
SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions
-
PiCo: Active Manifold Canonicalization for Robust Robotic Visual Anomaly Detection
-
3rd Place of MeViS-Audio Track of the 5th PVUW: VIRST-Audio
-
Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair
-
InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance
-
A Bayesian Learning Approach for Drone Coverage Network: A Case Study on Cardiac Arrest in Scotland
-
HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature
-
DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models
-
Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy
-
Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models
-
VoDaSuRe: A Large-Scale Dataset Revealing Domain Shift in Volumetric Super-Resolution
-
Conformal Cross-Modal Active Learning
-
UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities
-
Dual Contrastive Network for Few-Shot Remote Sensing Image Scene Classification
-
PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning
-
GSwap: Realistic Head Swapping with Dynamic Neural Gaussian Field
-
Robust Safety Monitoring of Language Models via Activation Watermarking
-
From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service
-
A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control
-
SAiW: Source-Attributable Invisible Watermarking for Proactive Deepfake Defense
-
APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs
-
Gimbal360: Differentiable Auto-Leveling for Canonicalized $360^\circ$ Panoramic Image Completion
-
Reasoning over Semantic IDs Enhances Generative Recommendation
-
ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment
-
ViKey: Enhancing Temporal Understanding in Videos via Visual Prompting
-
Gaze-Regularized VLMs for Ego-Centric Behavior Understanding
MongoDB - Build AI That Scales
