Papers
-
Unpaired Cross-Domain Calibration of DMSP to VIIRS Nighttime Light Data Based on CUT Network
-
DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification
-
Robust Physics-Guided Diffusion for Full-Waveform Inversion
-
Fanar 2.0: Arabic Generative AI Stack
-
Deep Reinforcement Learning-Assisted Automated Operator Portfolio for Constrained Multi-objective Optimization
-
Near-light Photometric Stereo with Symmetric Lights
-
Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network
-
Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic
-
PlotTwist: A Creative Plot Generation Framework with Small Language Models
-
RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery
-
Trained Persistent Memory for Frozen Encoder--Decoder LLMs: Six Architectural Methods
-
IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time
-
Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
-
HGP-Mamba: Integrating Histology and Generated Protein Features for Mamba-based Multimodal Survival Risk Prediction
-
SF-Mamba: Rethinking State Space Model for Vision
-
3D Fourier-based Global Feature Extraction for Hyperspectral Image Classification
-
Cross-modal learning for plankton recognition
-
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU
-
LenghuSky-8: An 8-Year All-Sky Cloud Dataset with Star-Aware Masks and Alt-Az Calibration for Segmentation and Nowcasting
-
EngGPT2: Sovereign, Efficient and Open Intelligence
-
IRIS: A Real-World Benchmark for Inverse Recovery and Identification of Physical Dynamic Systems from Monocular Video
-
From Natural Language to Executable Option Strategies via Large Language Models
-
VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization
-
DISCOVER: A Solver for Distributional Counterfactual Explanations
-
CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection
-
Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models
-
Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge Distillation
-
Hybrid Classical-Quantum Transfer Learning with Noisy Quantum Circuits
-
Visual Distraction Undermines Moral Reasoning in Vision-Language Models
-
Unified Removal of Raindrops and Reflections: A New Benchmark and A Novel Pipeline
-
ProgressiveAvatars: Progressive Animatable 3D Gaussian Avatars
-
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas
-
TinyGLASS: Real-Time Self-Supervised In-Sensor Anomaly Detection
-
RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Stability of LLM Agents in Realistic Retail Environments
-
Evo-Retriever: LLM-Guided Curriculum Evolution with Viewpoint-Pathway Collaboration for Multimodal Document Retrieval
-
DynHD: Hallucination Detection for Diffusion Large Language Models via Denoising Dynamics Deviation Learning
-
Are a Thousand Words Better Than a Single Picture? Beyond Images -- A Framework for Multi-Modal Knowledge Graph Dataset Enrichment
-
GAP-MLLM: Geometry-Aligned Pre-training for Activating 3D Spatial Perception in Multimodal Large Language Models
-
Linearized Bregman Iterations for Sparse Spiking Neural Networks
-
Follow the Clues, Frame the Truth: Hybrid-evidential Deductive Reasoning in Open-Vocabulary Multimodal Emotion Recognition
-
Multi-Agent Reinforcement Learning Counteracts Delayed CSI in Multi-Satellite Systems
-
The State of Generative AI in Software Development: Insights from Literature and a Developer Survey
-
Breaking the Chain: A Causal Analysis of LLM Faithfulness to Intermediate Structures
-
Optimal uncertainty bounds for multivariate kernel regression under bounded noise: A Gaussian process-based dual function
-
DST-Net: A Dual-Stream Transformer with Illumination-Independent Feature Guidance and Multi-Scale Spatial Convolution for Low-Light Image Enhancement
-
On the Emotion Understanding of Synthesized Speech
-
Implementation of tangent linear and adjoint models for neural networks based on a compiler library tool
-
Unlearning for One-Step Generative Models via Unbalanced Optimal Transport
-
ExpressMind: A Multimodal Pretrained Large Language Model for Expressway Operation
-
AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents
MongoDB - Build AI That Scales
