Papers
-
Aligning What EEG Can See: Structural Representations for Brain-Vision Matching
-
CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMs
-
Entropy-Aware On-Policy Distillation of Language Models
-
VLN-Cache: Enabling Token Caching for VLN Models with Visual/Semantic Dynamics Awareness
-
Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction
-
Countdown-Code: A Testbed for Studying The Emergence and Generalization of Reward Hacking in RLVR
-
mAVE: A Watermark for Joint Audio-Visual Generation Models
-
Statistical Contraction for Chance-Constrained Trajectory Optimization of Non-Gaussian Stochastic Systems
-
Facial Expression Generation Aligned with Human Preference for Natural Dyadic Interaction
-
NuNext: Reframing Nucleus Detection as Next-Point Detection
-
Grounding Machine Creativity in Game Design Knowledge Representations: Empirical Probing of LLM-Based Executable Synthesis of Goal Playable Patterns under Structural Constraints
-
Efficient Personalized Reranking with Semi-Autoregressive Generation and Online Knowledge Distillation
-
Deep Generative Spatiotemporal Engression for Probabilistic Forecasting of Epidemics
-
Vision Language Models Cannot Reason About Physical Transformation
-
Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information
-
Efficient Chest X-ray Representation Learning via Semantic-Partitioned Contrastive Learning
-
aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric Hardness
-
Turn: A Language for Agentic Computation
-
TIQA: Human-Aligned Text Quality Assessment in Generated Images
-
Inter-Image Pixel Shuffling for Multi-focus Image Fusion
-
Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers
-
Deep Expert Injection for Anchoring Retinal VLMs with Domain-Specific Knowledge
-
The Model Knows Which Tokens Matter: Automatic Token Selection via Noise Gating
-
Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language
-
PDD: Manifold-Prior Diverse Distillation for Medical Anomaly Detection
-
CanoVerse: 3D Object Scalable Canonicalization and Dataset for Generation and Pose
-
LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models
-
Fine-Grained Table Retrieval Through the Lens of Complex Queries
-
Agentic Planning with Reasoning for Image Styling via Offline RL
-
AMB-DSGDN: Adaptive Modality-Balanced Dynamic Semantic Graph Differential Network for Multimodal Emotion Recognition
-
Improving reasoning at inference time via uncertainty minimisation
-
Spectral Conditioning of Attention Improves Transformer Performance
-
PromptGate Client Adaptive Vision Language Gating for Open Set Federated Active Learning
-
ACD-U: Asymmetric co-teaching with machine unlearning for robust learning with noisy labels
-
Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts
-
Class Visualizations and Activation Atlases for Enhancing Interpretability in Deep Learning-Based Computational Pathology
-
Learning to Rank the Initial Branching Order of SAT Solvers
-
FreeFly-Thinking : Aligning Chain-of-Thought Reasoning with Continuous UAV Navigation
-
From State Changes to Creative Decisions: Documenting and Interpreting Traces Across Creative Domains
-
Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice
-
FastSTAR: Spatiotemporal Token Pruning for Efficient Autoregressive Video Synthesis
-
Shaping Parameter Contribution Patterns for Out-of-Distribution Detection
-
$\textbf{Re}^{2}$: Unlocking LLM Reasoning via Reinforcement Learning with Re-solving
-
A Dual-Graph Spatiotemporal GNN Surrogate for Nonlinear Response Prediction of Reinforced Concrete Beams under Four-Point Bending
-
Lying to Win: Assessing LLM Deception through Human-AI Games and Parallel-World Probing
-
wDPO: Winsorized Direct Preference Optimization for Robust LLM Alignment
-
Towards Objective Gastrointestinal Auscultation: Automated Segmentation and Annotation of Bowel Sound Patterns
-
A Miniature Brain Transformer: Thalamic Gating, Hippocampal Lateralization, Amygdaloid Salience, and Prefrontal Working Memory in Attention-Coupled Latent Memory
-
Margin in Abstract Spaces
-
VINO: Video-driven Invariance for Non-contextual Objects via Structural Prior Guided De-contextualization
