Papers
-
Beyond Quadratic: Linear-Time Change Detection with RWKV
-
Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction
-
Physion-Eval: Evaluating Physical Realism in Generated Video via Human Reasoning
-
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
-
LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment
-
ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding
-
Emergency Preemption Without Online Exploration: A Decision Transformer Approach
-
Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL
-
OrbitNVS: Harnessing Video Diffusion Priors for Novel View Synthesis
-
CAF-Score: Calibrating CLAP with LALMs for Reference-free Audio Captioning Evaluation
-
UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair
-
On Performance Guarantees for Federated Learning with Personalized Constraints
-
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
-
Forward and inverse problems for measure flows in Bayes Hilbert spaces
-
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement
-
Continual Learning for Food Category Classification Dataset: Enhancing Model Adaptability and Performance
-
IUP-Pose: Decoupled Iterative Uncertainty Propagation for Real-time Relative Pose Regression via Implicit Dense Alignment v1
-
ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography
-
Dual Prompt-Driven Feature Encoding for Nighttime UAV Tracking
-
On the role of memorization in learned priors for geophysical inverse problems
-
Alternating Diffusion for Proximal Sampling with Zeroth Order Queries
-
MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking
-
BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection
-
RiboSphere: Learning Unified and Efficient Representations of RNA Structures
-
UniBioTransfer: A Unified Framework for Multiple Biometrics Transfer
-
HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning
-
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework
-
Heavy-Tailed and Long-Range Dependent Noise in Stochastic Approximation: A Finite-Time Analysis
-
PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization
-
Ensembles-based Feature Guided Analysis
-
GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence
-
Model Selection and Parameter Estimation of Multi-dimensional Gaussian Mixture Model
-
CS-MUNet: A Channel-Spatial Dual-Stream Mamba Network for Multi-Organ Segmentation
-
Semantic Audio-Visual Navigation in Continuous Environments
-
The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference
-
Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding
-
Structured Prompting for Arabic Essay Proficiency: A Trait-Centric Evaluation Approach
-
Scale-Dependent Radial Geometry and Metric Mismatch in Wasserstein Propagation for Reverse Diffusion
-
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning
-
Making Video Models Adhere to User Intent with Minor Adjustments
-
DynFlowDrive: Flow-Based Dynamic World Modeling for Autonomous Driving
-
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models
-
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
-
Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification
-
Unbiased Dynamic Multimodal Fusion
-
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction
-
Ontology-Based Knowledge Modeling and Uncertainty-Aware Outdoor Air Quality Assessment Using Weighted Interval Type-2 Fuzzy Logic
-
TSegAgent: Zero-Shot Tooth Segmentation via Geometry-Aware Vision-Language Agents
-
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents
-
Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits
MongoDB - Build AI That Scales
