Papers
-
Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration
-
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts
-
Focus, Don't Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding
-
When AI Shows Its Work, Is It Actually Working? Step-Level Evaluation Reveals Frontier Language Models Frequently Bypass Their Own Reasoning
-
TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment
-
RadTimeline: Timeline Summarization for Longitudinal Radiological Lung Findings
-
Cross-Slice Knowledge Transfer via Masked Multi-Modal Heterogeneous Graph Contrastive Learning for Spatial Gene Expression Inference
-
Empirical Comparison of Agent Communication Protocols for Task Orchestration
-
Towards The Implicit Bias on Multiclass Separable Data Under Norm Constraints
-
MVRD-Bench: Multi-View Learning and Benchmarking for Dynamic Remote Photoplethysmography under Occlusion
-
Improving Safety Alignment via Balanced Direct Preference Optimization
-
Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts
-
MultiCam: On-the-fly Multi-Camera Pose Estimation Using Spatiotemporal Overlaps of Known Objects
-
URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection
-
UAV-DETR: DETR for Anti-Drone Target Detection
-
L-UNet: An LSTM Network for Remote Sensing Image Change Detection
-
PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal
-
CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models
-
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
-
UniQueR: Unified Query-based Feedforward 3D Reconstruction
-
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction
-
Agent Audit: A Security Analysis System for LLM Agent Applications
-
Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer
-
TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design
-
The Coordinate System Problem in Persistent Structural Memory for Neural Architectures
-
A Feature Shuffling and Restoration Strategy for Universal Unsupervised Anomaly Detection
-
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
-
Agent-Sentry: Bounding LLM Agents via Execution Provenance
-
Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories
-
Designing to Forget: Deep Semi-parametric Models for Unlearning
-
Dynamical Systems Theory Behind a Hierarchical Reasoning Model
-
ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance
-
Template-Based Feature Aggregation Network for Industrial Anomaly Detection
-
Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models
-
Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic
-
Confidence Calibration under Ambiguous Ground Truth
-
TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration
-
Group Editing: Edit Multiple Images in One Go
-
A Heterogeneous Long-Micro Scale Cascading Architecture for General Aviation Health Management
-
Conditionally Identifiable Latent Representation for Multivariate Time Series with Structural Dynamics
-
VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents
-
SLARM: Streaming and Language-Aligned Reconstruction Model for Dynamic Scenes
-
Off-Policy Evaluation and Learning for Survival Outcomes under Censoring
-
Separating Diagnosis from Control: Auditable Policy Adaptation in Agent-Based Simulations with LLM-Based Diagnostics
-
Dual-Teacher Distillation with Subnetwork Rectification for Black-Box Domain Adaptation
-
EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction
-
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling
-
From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture
-
Multilingual KokoroChat: A Multi-LLM Ensemble Translation Method for Creating a Multilingual Counseling Dialogue Dataset
-
When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse
MongoDB - Build AI That Scales
