Papers
-
System-Anchored Knee Estimation for Low-Cost Context Window Selection in PDE Forecasting
-
CARE: Training-Free Controllable Restoration for Medical Images via Dual-Latent Steering
-
Optimal High-Probability Regret for Online Convex Optimization with Two-Point Bandit Feedback
-
From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support
-
Epistemic Compression: The Case for Deliberate Ignorance in High-Stakes AI
-
Mechanistically Interpreting Compression in Vision-Language Models
-
GeoNDC: A Queryable Neural Data Cube for Planetary-Scale Earth Observation
-
Uncertainty Quantification for Quantum Computing
-
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
-
MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes
-
MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting
-
Approaches to Analysing Historical Newspapers Using LLMs
-
Closing the Confidence-Faithfulness Gap in Large Language Models
-
GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
-
Synergistic Event-SVE Imaging for Quantitative Propellant Combustion Diagnostics
-
The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities
-
Learning Explicit Continuous Motion Representation for Dynamic Gaussian Splatting from Monocular Videos
-
SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Autoregressive Contrastive Learning
-
TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization
-
Evaluating Synthetic Images as Effective Substitutes for Experimental Data in Surface Roughness Classification
-
Ultra-fast Traffic Nowcasting and Control via Differentiable Agent-based Simulation
-
An Explainable Ensemble Learning Framework for Crop Classification with Optimized Feature Pyramids and Deep Networks
-
GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding
-
Z-Erase: Enabling Concept Erasure in Single-Stream Diffusion Transformers
-
Sparse Visual Thought Circuits in Vision-Language Models
-
Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs
-
Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization
-
Visual Attention Drifts,but Anchors Hold:Mitigating Hallucination in Multimodal Large Language Models via Cross-Layer Visual Anchors
-
THEMIS: Towards Holistic Evaluation of MLLMs for Scientific Paper Fraud Forensics
-
ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models
-
Pixelis: Reasoning in Pixels, from Seeing to Acting
-
Process-Aware AI for Rainfall-Runoff Modeling: A Mass-Conserving Neural Framework with Hydrological Process Constraints
-
ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents
-
Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization
-
From Logic Monopoly to Social Contract: Separation of Power and the Institutional Foundations for Autonomous Agent Economies
-
Unlocking Strong Supervision: A Data-Centric Study of General-Purpose Audio Pre-Training Methods
-
UCAgent: An End-to-End Agent for Block-Level Functional Verification
-
Layer-Specific Lipschitz Modulation for Fault-Tolerant Multimodal Representation Learning
-
OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs
-
Label What Matters: Modality-Balanced and Difficulty-Aware Multimodal Active Learning
-
MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning
-
MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness
-
SEVerA: Verified Synthesis of Self-Evolving Agents
-
Deep Learning Aided Vision System for Planetary Rovers
-
Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
-
A Comparative Investigation of Thermodynamic Structure-Informed Neural Networks
-
When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning
-
AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization
-
The Language of Touch: Translating Vibrations into Text with Dual-Branch Learning
-
MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation
