Papers
-
ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding
-
DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona
-
From Overload to Convergence: Supporting Multi-Issue Human-AI Negotiation with Bayesian Visualization
-
Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases
-
From Pixels to Semantics: A Multi-Stage AI Framework for Structural Damage Detection in Satellite Imagery
-
From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
-
Explainable Threat Attribution for IoT Networks Using Conditional SHAP and Flow Behavior Modelling
-
Viewport-based Neural 360° Image Compression
-
AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model
-
KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao
-
Typography-Based Monocular Distance Estimation Framework for Vehicle Safety Systems
-
Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models
-
Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models
-
Exposure-Normalized Bed and Chair Fall Rates via Continuous AI Monitoring
-
Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis
-
Quantum Random Forest for the Regression Problem
-
ABSTRAL: Automatic Design of Multi-Agent Systems Through Iterative Refinement and Topology Optimization
-
Reliable Classroom AI via Neuro-Symbolic Multimodal Reasoning
-
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal
-
Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG
-
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding
-
Span Modeling for Idiomaticity and Figurative Language Detection with Span Contrastive Loss
-
Transformers Trained via Gradient Descent Can Provably Learn a Class of Teacher Models
-
Combinatorial Privacy: Private Multi-Party Bitstream Grand Sum by Hiding in Birkhoff Polytopes
-
Universal and efficient graph neural networks with dynamic attention for machine learning interatomic potentials
-
Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration
-
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts
-
Focus, Don't Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding
-
When AI Shows Its Work, Is It Actually Working? Step-Level Evaluation Reveals Frontier Language Models Frequently Bypass Their Own Reasoning
-
TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment
-
RadTimeline: Timeline Summarization for Longitudinal Radiological Lung Findings
-
Cross-Slice Knowledge Transfer via Masked Multi-Modal Heterogeneous Graph Contrastive Learning for Spatial Gene Expression Inference
-
Empirical Comparison of Agent Communication Protocols for Task Orchestration
-
Towards The Implicit Bias on Multiclass Separable Data Under Norm Constraints
-
MVRD-Bench: Multi-View Learning and Benchmarking for Dynamic Remote Photoplethysmography under Occlusion
-
Improving Safety Alignment via Balanced Direct Preference Optimization
-
Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts
-
MultiCam: On-the-fly Multi-Camera Pose Estimation Using Spatiotemporal Overlaps of Known Objects
-
URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection
-
UAV-DETR: DETR for Anti-Drone Target Detection
-
L-UNet: An LSTM Network for Remote Sensing Image Change Detection
-
PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal
-
CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models
-
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
-
UniQueR: Unified Query-based Feedforward 3D Reconstruction
-
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction
-
Agent Audit: A Security Analysis System for LLM Agent Applications
-
Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer
-
TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design
-
The Coordinate System Problem in Persistent Structural Memory for Neural Architectures
MongoDB - Build AI That Scales
