Papers
-
VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs
-
Bridging Conformal Prediction and Scenario Optimization: Discarded Constraints and Modular Risk Allocation
-
Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning
-
Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence
-
Scalable Prompt Routing via Fine-Grained Latent Task Discovery
-
Investigating In-Context Privacy Learning by Integrating User-Facing Privacy Tools into Conversational Agents
-
Pseudo-Labeling for Unsupervised Domain Adaptation with Kernel GLMs
-
The Autonomy Tax: Defense Training Breaks LLM Agents
-
Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure
-
Vocabulary shapes cross-lingual variation of word-order learnability in language models
-
When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)
-
Subspace Projection Methods for Fast Spectral Embeddings of Evolving Graphs
-
Near-Equivalent Q-learning Policies for Dynamic Treatment Regimes
-
LoFi: Location-Aware Fine-Grained Representation Learning for Chest X-ray
-
TrustFlow: Topic-Aware Vector Reputation Propagation for Multi-Agent Ecosystems
-
Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas
-
In-the-Wild Camouflage Attack on Vehicle Detectors through Controllable Image Editing
-
GeoLAN: Geometric Learning of Latent Explanatory Directions in Large Language Models
-
Deep Hilbert--Galerkin Methods for Infinite-Dimensional PDEs and Optimal Control
-
Global Convergence of Multiplicative Updates for the Matrix Mechanism: A Collaborative Proof with Gemini 3
-
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models
-
A Framework for Formalizing LLM Agent Security
-
Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL
-
Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids
-
TRACE: Trajectory Recovery with State Propagation Diffusion for Urban Mobility
-
Narrative Aligned Long Form Video Question Answering
-
Instruction-Free Tuning of Large Vision Language Models for Medical Instruction Following
-
Any-Subgroup Equivariant Networks via Symmetry Breaking
-
VeloxNet: Efficient Spatial Gating for Lightweight Embedded Image Classification
-
ICLAD: In-Context Learning for Unified Tabular Anomaly Detection Across Supervision Regimes
-
Bypassing Document Ingestion: An MCP Approach to Financial Q&A
-
Teaching an Agent to Sketch One Part at a Time
-
Stochastic Sequential Decision Making over Expanding Networks with Graph Filtering
-
Vision Tiny Recursion Model (ViTRM): Parameter-Efficient Image Classification via Recursive State Refinement
-
Beyond the Desk: Barriers and Future Opportunities for AI to Assist Scientists in Embodied Physical Tasks
-
Which Workloads Belong in Orbit? A Workload-First Framework for Orbital Data Centers Using Semantic Abstraction
-
Linear Social Choice with Few Queries: A Moment-Based Approach
-
FedAgain: A Trust-Based and Robust Federated Learning Strategy for an Automated Kidney Stone Identification in Ureteroscopy
-
Learning to Disprove: Formal Counterexample Generation with Large Language Models
-
ItinBench: Benchmarking Planning Across Multiple Cognitive Dimensions with Large Language Models
-
Gastric-X: A Multimodal Multi-Phase Benchmark Dataset for Advancing Vision-Language Models in Gastric Cancer Analysis
-
ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding
-
Inducing Sustained Creativity and Diversity in Large Language Models
-
Recognising BSL Fingerspelling in Continuous Signing Sequences
-
The Causal Impact of Tool Affordance on Safety Alignment in LLM Agents
-
GIP-RAG: An Evidence-Grounded Retrieval-Augmented Framework for Interpretable Gene Interaction and Pathway Impact Analysis
-
Depictions of Depression in Generative AI Video Models: A Preliminary Study of OpenAI's Sora 2
-
SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions
-
The Efficiency Attenuation Phenomenon: A Computational Challenge to the Language of Thought Hypothesis
-
dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv3
MongoDB - Build AI That Scales
