Papers
-
VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAMGeorge Mason University
-
KernelSkill: A Multi-Agent Framework for GPU Kernel OptimizationBeihang University, Nanyang Technological University, Zhejiang Lab
-
GNNs for Time Series Anomaly Detection: An Open-Source Framework and a Critical EvaluationUniversidad de la República Montevideo
-
Logics-Parsing-Omni Technical Report
-
EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming LanguagesLossfunk
-
Improving 3D Foot Motion Reconstruction in Markerless Monocular Human Motion CaptureLeibniz Universität Hannover
-
On Catastrophic Forgetting in Low-Rank Decomposition-Based Parameter-Efficient Fine-TuningUniversity of British Columbia
-
Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health RecordsUniversità Campus Bio-Medico di Roma, University Medical Center Utrecht
-
Fusing Semantic, Lexical, and Domain Perspectives for Recipe Similarity EstimationJožef Stefan Institute, Ss. Cyril and Methodius University in Skopje, University of Ljubljana
-
AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question AnsweringVietnam National University Hanoi
-
ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog ModelingCentral South University, Research Center for Social Computing and Information
-
ActiveUltraFeedback: Efficient Preference Data Generation using Active LearningETH Zurich
-
Physics-informed neural operator for predictive parametric phase-field modellingTongji University
-
DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds
-
TemporalDoRA: Temporal PEFT for Robust Surgical Video Question AnsweringIstituto di Ricovero e Cura a Carattere Scientifico, Politecnico di Milano, The University of Manchester, University College London
-
Mousse: Rectifying the Geometry of Muon with Curvature-Aware PreconditioningFudan University, Shanghai AI Lab, Shanghai Jiao Tong University
-
TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SRMehran University of Engineering and Technology, Sejong University, University of Würzburg
-
ProGS: Towards Progressive Coding for 3D Gaussian SplattingThe Institute of Electrical and Electronics Engineers
-
Evaluation of LLMs in retrieving food and nutritional context for RAG systemsJožef Stefan Institute
-
OOD-MMSafe: Advancing MLLM Safety from Harmful Intent to Hidden Consequences
-
From Phase Prediction to Phase Design: A ReAct Agent Framework for High-Entropy Alloy DiscoveryLuxembourg Institute of Science and Technology, University of Luxembourg
-
MUGEN: Evaluating and Improving Multi-audio Understanding of Large Audio-Language ModelsNational Taiwan University
-
Does the Question Really Matter? Training-Free Data Selection for Vision-Language SFTNanjing University, North University of China, Shanghai Artificial Intelligence Laboratory
-
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive AgentsInstitute for Advanced Algorithms Research, Shanghai Jiao Tong University
-
GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming SystemThe Institute of Electrical and Electronics Engineers
-
FrameDiT: Diffusion Transformer with Frame-Level Matrix Attention for Efficient Video Generation
-
RbtAct: Rebuttal as Supervision for Actionable Review Feedback GenerationNew York University, TCS Research, Yale University
-
ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-SkippingPolar Bear Tech, Tsinghua University
-
A Multi-Prototype-Guided Federated Knowledge Distillation Approach in AI-RAN Enabled Multi-Access Edge Computing SystemKyung Hee University, Noakhali Science and Technology University, Sungkyunkwan University
-
EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning
-
FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video AnalysisSichuan University, Tsinghua University, University of California
-
$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera InputsHunan University, Karlsruhe Institute of Technology, Sofia University
-
Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous EnvironmentsZhejiang University
-
ENIGMA-360: An Ego-Exo Dataset for Human Behavior Understanding in Industrial ScenariosUniversity of Catania
-
Upper Generalization Bounds for Neural OscillatorsCalifornia Institute of Technology, Leibniz Universität Hannover, The Hong Kong Polytechnic University, University of Liverpool
-
LAP: A Language-Aware Planning Model For Procedure Planning In Instructional VideosOrebro University
-
Beyond Fine-Tuning: Robust Food Entity Linking under Ontology Drift with FoodOntoRAGJožef Stefan Institute, Jožef Stefan International Postgraduate School
-
LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention ControlHanyang University
-
PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor EnvironmentsHunan University
-
Ego: Embedding-Guided Personalization of Vision-Language Models
-
VCR: Variance-Driven Channel Recalibration for Robust Low-Light EnhancementEnergy Digital Intelligence Technology Development, University of Science and Technology of China
-
Removing the Trigger, Not the Backdoor: Alternative Triggers and Latent BackdoorsDelft University of Technology, Radboud University, University of Bergen, University of Zagreb
-
Global universality via discrete-time signatures
-
World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models
-
First Estimation of Model Parameters for Neutrino-Induced Nucleon Knockout Using Simulation-Based InferenceFermi National Accelerator Laboratory, University of Chicago
-
EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and InterpretingUniversity of Hildesheim
-
Quantifying the Necessity of Chain of Thought through Opaque Serial Depth
-
What is Missing? Explaining Neurons Activated by Absent ConceptsHessian.ai, Johannes Gutenberg University Mainz, Leibniz Institute for Resilience Research, Max Planck Institute for Informatics, Technical University of Darmstadt, University Medical Center Mainz
-
A Hybrid Quantum-Classical Framework for Financial Volatility Forecasting Based on Quantum Circuit Born Machines
-
Exploiting Label-Aware Channel Scoring for Adaptive Channel Pruning in Split LearningThe Institute of Electrical and Electronics Engineers
MongoDB - Build AI That Scales
