Papers
-
TEA-Time: Transporting Effects Across TimeDuke University, Yale University
-
AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-JudgeUniversity of Chicago
-
RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified StatesNanyang Technological University, Shandong University, Singapore Management University
-
OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic AugmentationCarleton College, Guangdong Laboratory of Artificial Intelligence and Digital Economy, Institute for Research in Biomedicine, Shenzhen University
-
Hit-RAG: Learning to Reason with Long Contexts via Preference AlignmentHuazhong University of Science and Technology, Shenzhen University of Advanced Technology, The City University of New York, Tongji University, University of Technology Sydney
-
Enhancing Web Agents with a Hierarchical Memory TreeBeijing Institute of Technology
-
Language-Aware Distillation for Multilingual Instruction-Following Speech LLMs with ASR-Only SupervisionAgency for Science, Technology and Research, Singapore, Institute for Infocomm Research, Nanyang Technological University, National University of Singapore
-
Permutation-Equivariant 2D State Space Models: Theory and Canonical Architecture for Multivariate Time SeriesKorea University
-
Resource-Adaptive Federated Text Generation with Differential PrivacyOak Ridge National Laboratory
-
Two Frames Matter: A Temporal Attack for Text-to-Video Model JailbreakingBeihang University, Wenzhou-Kean University
-
Targeted Bit-Flip Attacks on LLM-Based AgentsHuazhong University of Science and Technology, National University of Singapore, Quan Cheng Laboratory, Tsinghua University
-
Self-Supervised Multi-Modal World Model with 4D Space-Time EmbeddingAllen Institute for AI, Arizona State University, Georgia Institute of Technology, Stanford University, University of Florida, University of Houston, University of Illinois Urbana-Champaign
-
Fine-Grained 3D Facial Reconstruction for Micro-Expressions
-
Looking Back and Forth: Cross-Image Attention Calibration and Attentive Preference Learning for Multi-Image Hallucination MitigationBeijing Institute of Technology, Harbin Institute of Technology, Tsinghua University
-
Hindsight Credit Assignment for Long-Horizon LLM AgentsCity University of Hong Kong, Nanjing University
-
Animating Petascale Time-varying Data on Commodity Hardware with LLM-assisted ScriptingUniversity of Utah, Vanderbilt University
-
Bi-directional digital twin prototype anchoring with multi-periodicity learning for few-shot fault diagnosisShanghai Jiao Tong University
-
SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion TransformerHarbin Institute of Technology
-
MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering
-
User Review Writing via Interview with Dialogue SystemsThe University of Electro-Communications
-
VirtueBench: Evaluating Trustworthiness under Uncertainty in Long Video UnderstandingPeking University
-
The Talking Robot: Distortion-Robust Acoustic Models for Robot-Robot CommunicationGeorgia Institute of Technology, Institute of Science Tokyo
-
Interpretable Maximum Margin Deep Anomaly DetectionCapital Normal University, Yunnan University
-
Physics-Guided VLM Priors for All-Cloud RemovalWuhan University
-
Retinex Meets Language: A Physics-Semantics-Guided Underwater Image Enhancement NetworkOcean University of China
-
Aligning What EEG Can See: Structural Representations for Brain-Vision MatchingBeijing University of Posts and Telecommunications
-
CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMsBeihang University, Chinese Academy of Sciences, Nanjing University, Shenzhen Institutes of Advanced Technology, Shenzhen University of Advanced Technology, Southeast University, The University of Manchester, The University of New South Wales, University of Science and Technology of China
-
Entropy-Aware On-Policy Distillation of Language ModelsKorea Advanced Institute of Science & Technology, University of Toronto, Vector Institute
-
VLN-Cache: Enabling Token Caching for VLN Models with Visual/Semantic Dynamics AwarenessChina University of Geosciences, Huazhong University of Science and Technology, Peking University
-
Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation PredictionFriedrich Miescher Instiute for Biomedical Research
-
Countdown-Code: A Testbed for Studying The Emergence and Generalization of Reward Hacking in RLVRUniversity of Illinois Urbana-Champaign, University of Michigan
-
mAVE: A Watermark for Joint Audio-Visual Generation ModelsTsinghua University
-
Statistical Contraction for Chance-Constrained Trajectory Optimization of Non-Gaussian Stochastic SystemsUniversity of Illinois Urbana-Champaign
-
Facial Expression Generation Aligned with Human Preference for Natural Dyadic Interaction
-
NuNext: Reframing Nucleus Detection as Next-Point DetectionHarbin Institute of Technology, Nanjing University, Shanghai Artificial Intelligence Laboratory, University of Science and Technology Beijing, Westlake University, Zhejiang University
-
Grounding Machine Creativity in Game Design Knowledge Representations: Empirical Probing of LLM-Based Executable Synthesis of Goal Playable Patterns under Structural ConstraintsChalmers University of Technology, University of Gothenburg
-
Efficient Personalized Reranking with Semi-Autoregressive Generation and Online Knowledge DistillationBeijing University of Posts and Telecommunications, Shanghai Jiao Tong University, University of Science and Technology of China
-
Deep Generative Spatiotemporal Engression for Probabilistic Forecasting of EpidemicsSorbonne Center for Artificial Intelligence, Sorbonne University, Sorbonne University Abu Dhabi
-
Vision Language Models Cannot Reason About Physical TransformationAuburn University, Brown University, Carnegie Mellon University, Emory University, Johns Hopkins University, University of California, University of Michigan, University of Toronto
-
Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona InformationThe University of Electro-Communications
-
Efficient Chest X-ray Representation Learning via Semantic-Partitioned Contrastive LearningShenzhen University of Advanced Technology
-
aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric HardnessNankai University, Tsinghua University
-
Turn: A Language for Agentic Computation
-
TIQA: Human-Aligned Text Quality Assessment in Generated ImagesInnopolis University, Ivannikov Institute for System Programming of the Russian Academy of Sciences, Moscow State University
-
Inter-Image Pixel Shuffling for Multi-focus Image FusionHuaqiao University
-
Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers
-
Deep Expert Injection for Anchoring Retinal VLMs with Domain-Specific Knowledge
-
The Model Knows Which Tokens Matter: Automatic Token Selection via Noise Gating
-
Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language
-
PDD: Manifold-Prior Diverse Distillation for Medical Anomaly Detection
MongoDB - Build AI That Scales
