Papers
-
Deep Reinforcement Learning and The Tale of Two Temporal Difference Errors
-
Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support
-
Instruction-Tuned, but Not More Verifiable Instruction-Following: A Cross-Task Diagnosis for LoRA Adapters
-
The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation
-
SatGeo-NeRF: Geometrically Regularized NeRF for Satellite Imagery
-
Camera-Agnostic Pruning of 3D Gaussian Splats via Descriptor-Based Beta Evidence
-
Chronological Contrastive Learning: Few-Shot Progression Assessment in Irreversible Diseases
-
Cross-Instance Gaussian Splatting Registration via Geometry-Aware Feature-Guided Alignment
-
MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation
-
FeatDistill: A Feature Distillation Enhanced Multi-Expert Ensemble Framework for Robust AI-generated Image Detection
-
SLURP-TN : Resource for Tunisian Dialect Spoken Language Understanding
-
Suiren-1.0 Technical Report: A Family of Molecular Foundation Models
-
GeoFlow: Real-Time Fine-Grained Cross-View Geolocalization via Iterative Flow Prediction
-
Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection
-
Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention
-
BHDD: A Burmese Handwritten Digit Dataset
-
Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning
-
Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
-
SecureBreak -- A dataset towards safe and secure models
-
BOOST-RPF: Boosted Sequential Trees for Radial Power Flow
-
GeoFusion-CAD: Structure-Aware Diffusion with Geometric State Space for Parametric 3D Design
-
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
-
LRC-WeatherNet: LiDAR, RADAR, and Camera Fusion Network for Real-time Weather-type Classification in Autonomous Driving
-
Symbolic Graph Networks for Robust PDE Discovery from Noisy Sparse Data
-
TREX: Trajectory Explanations for Multi-Objective Reinforcement Learning
-
λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks
-
STENet: Superpixel Token Enhancing Network for RGB-D Salient Object Detection
-
CRPS-Optimal Binning for Conformal Regression
-
SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation
-
A plug-and-play approach with fast uncertainty quantification for weak lensing mass mapping
-
On the Challenges and Opportunities of Learned Sparse Retrieval for Code
-
6D Robotic OCT Scanning of Curved Tissue Surfaces
-
Retrieving Climate Change Disinformation by Narrative
-
ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention
-
AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing
-
Do Papers Match Code? A Benchmark and Framework for Paper-Code Consistency Detection in Bioinformatics Software
-
Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models
-
On the Interplay of Priors and Overparametrization in Bayesian Neural Network Posteriors
-
Future-Interactions-Aware Trajectory Prediction via Braid Theory
-
GTSR: Subsurface Scattering Awared 3D Gaussians for Translucent Surface Reconstruction
-
RAFL: Generalizable Sim-to-Real of Soft Robots with Residual Acceleration Field Learning
-
DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation
-
Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models
-
MAGPI: Multifidelity-Augmented Gaussian Process Inputs for Surrogate Modeling from Scarce Data
-
AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference
-
FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
-
Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch
-
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning
-
On the Failure of Topic-Matched Contrast Baselines in Multi-Directional Refusal Abliteration
-
Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning
MongoDB - Build AI That Scales
