Papers
-
Generating and Evaluating Sustainable Procurement Criteria for the Swiss Public Sector using In-Context Prompting with Large Language Models
-
High Resolution Flood Extent Detection Using Deep Learning with Random Forest Derived Training Labels
-
LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface
-
Adversarial Vulnerabilities in Neural Operator Digital Twins: Gradient-Free Attacks on Nuclear Thermal-Hydraulic Surrogates
-
Learning Sidewalk Autopilot from Multi-Scale Imitation with Corrective Behavior Expansion
-
GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs
-
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
-
Multimodal Training to Unimodal Deployment: Leveraging Unstructured Data During Training to Optimize Structured Data Only Deployment
-
UrbanVGGT: Scalable Sidewalk Width Estimation from Street View Images
-
Generalized multi-object classification and tracking with sparse feature resonator networks
-
Maximum Entropy Relaxation of Multi-Way Cardinality Constraints for Synthetic Population Generation
-
AI Mental Models: Learned Intuition and Deliberation in a Bounded Neural Architecture
-
Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling
-
MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data
-
Reddit After Roe: A Computational Analysis of Abortion Narratives and Barriers in the Wake of Dobbs
-
CanViT: Toward Active-Vision Foundation Models
-
FullCircle: Effortless 3D Reconstruction from Casual 360$^\circ$ Captures
-
CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context
-
STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
-
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?
-
A vision-language model and platform for temporally mapping surgery from video
-
A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks
-
flexvec: SQL Vector Retrieval with Programmatic Embedding Modulation
-
Precision-Varying Prediction (PVP): Robustifying ASR systems against adversarial attacks
-
Language Models Can Explain Visual Features via Steering
-
TrajLoom: Dense Future Trajectory Generation from Video
-
Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off
-
Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length
-
Do Consumers Accept AIs as Moral Compliance Agents?
-
Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning
-
Causal Discovery in Action: Learning Chain-Reaction Mechanisms from Interventions
-
Transfer learning via interpolating structures
-
A Vision Language Model for Generating Procedural Plant Architecture Representations from Simulated Images
-
To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models
-
Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion
-
Upper Entropy for 2-Monotone Lower Probabilities
-
PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis
-
Single-Subject Multi-View MRI Super-Resolution via Implicit Neural Representations
-
LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation
-
CAM3R: Camera-Agnostic Model for 3D Reconstruction
-
Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature
-
Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals
-
Q-Tacit: Image Quality Assessment via Latent Visual Reasoning
-
Multi-Method Validation of Large Language Model Medical Translation Across High- and Low-Resource Languages
-
CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training
-
Overfitting and Generalizing with (PAC) Bayesian Prediction in Noisy Binary Classification
-
AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research
-
Pretext Matters: An Empirical Study of SSL Methods in Medical Imaging
-
MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping
-
Cross-Context Verification: Hierarchical Detection of Benchmark Contamination through Session-Isolated Analysis
MongoDB - Build AI That Scales
