Papers
-
DB SwinT: A Dual-Branch Swin Transformer Network for Road Extraction in Optical Remote Sensing Imagery
-
UW-VOS: A Large-Scale Dataset for Underwater Video Object Segmentation
-
CVPD at QIAS 2026: RAG-Guided LLM Reasoning for Al-Mawarith Share Computation and Heir Allocation
-
Language-Grounded Multi-Agent Planning for Personalized and Fair Participatory Urban Sensing
-
COVTrack++: Learning Open-Vocabulary Multi-Object Tracking from Continuous Videos via a Synergistic Paradigm
-
ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents
-
Schema on the Inside: A Two-Phase Fine-Tuning Method for High-Efficiency Text-to-SQL at Scale
-
i-IF-Learn: Iterative Feature Selection and Unsupervised Learning for High-Dimensional Complex Data
-
Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection
-
Lagrangian Relaxation Score-based Generation for Mixed Integer linear Programming
-
From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs
-
SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision
-
A$^3$: Towards Advertising Aesthetic Assessment
-
SemLayer: Semantic-aware Generative Segmentation and Layer Construction for Abstract Icons
-
Minimal Sufficient Representations for Self-interpretable Deep Neural Networks
-
HAM: A Training-Free Style Transfer Approach via Heterogeneous Attention Modulation for Diffusion Models
-
MoE-Sieve: Routing-Guided LoRA for Efficient MoE Fine-Tuning
-
LGEST: Dynamic Spatial-Spectral Expert Routing for Hyperspectral Image Classification
-
FinToolSyn: A forward synthesis Framework for Financial Tool-Use Dialogue Data with Dynamic Tool Retrieval
-
Hierarchical Spatial-Temporal Graph-Enhanced Model for Map-Matching
-
Beyond Semantic Priors: Mitigating Optimization Collapse for Generalizable Visual Forensics
-
Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification
-
AD-Reasoning: Multimodal Guideline-Guided Reasoning for Alzheimer's Disease Diagnosis
-
A Step Toward Federated Pretraining of Multimodal Large Language Models
-
Enhanced Mycelium of Thought (EMoT): A Bio-Inspired Hierarchical Reasoning Architecture with Strategic Dormancy and Mnemonic Encoding
-
ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing
-
The impact of sensor placement on graph-neural-network-based leakage detection
-
PosterIQ: A Design Perspective Benchmark for Poster Understanding and Generation
-
When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm
-
LLMpedia: A Transparent Framework to Materialize an LLM's Encyclopedic Knowledge at Scale
-
Brain-Inspired Multimodal Spiking Neural Network for Image-Text Retrieval
-
Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning
-
Bridging the Evaluation Gap: Standardized Benchmarks for Multi-Objective Search
-
LGTM: Training-Free Light-Guided Text-to-Image Diffusion Model via Initial Noise Manipulation
-
Dual-Graph Multi-Agent Reinforcement Learning for Handover Optimization
-
Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization
-
LaDy: Lagrangian-Dynamic Informed Network for Skeleton-based Action Segmentation via Spatial-Temporal Modulation
-
KCLNet: Electrically Equivalence-Oriented Graph Representation Learning for Analog Circuits
-
ReMemNav: A Rethinking and Memory-Augmented Framework for Zero-Shot Object Navigation
-
Causality-Driven Disentangled Representation Learning in Multiplex Graphs
-
Granular Ball Guided Stable Latent Domain Discovery for Domain-General Crowd Counting
-
Comparative analysis of dual-form networks for live land monitoring using multi-modal satellite image time series
-
Toward a Multi-Layer ML-Based Security Framework for Industrial IoT
-
Mixed-signal implementation of feedback-control optimizer for single-layer Spiking Neural Networks
-
Retinal Layer Segmentation in OCT Images With 2.5D Cross-slice Feature Fusion Module for Glaucoma Assessment
-
Combi-CAM: A Novel Multi-Layer Approach for Explainable Image Geolocalization
-
The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
-
Alignment Reduces Expressed but Not Encoded Gender Bias: A Unified Framework and Study
-
Likelihood hacking in probabilistic program synthesis
-
On Gossip Algorithms for Machine Learning with Pairwise Objectives
