Papers
-
Towards Understanding Adam Convergence on Highly Degenerate Polynomials
-
BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
-
Nonparametric Variational Differential Privacy via Embedding Parameter Clipping
-
Memorization capacity of deep ReLU neural networks characterized by width and depth
-
Build, Borrow, or Just Fine-Tune? A Political Scientist's Guide to Choosing NLP Models
-
Symbolic Discovery of Stochastic Differential Equations with Genetic Programming
-
Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning
-
A Variational Latent Equilibrium for Learning in Neuronal Circuits
-
MM-algorithms for traditional and convex NMF with Tweedie and Negative Binomial cost functions and empirical evaluation
-
Digging Deeper: Learning Multi-Level Concept Hierarchies
-
Learning the Hierarchical Organization in Brain Network for Brain Disorder Diagnosis
-
ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis
-
A Saccade-inspired Approach to Image Classification using Vision Transformer Attention Maps
-
Surgical Repair of Collapsed Attention Heads in ALiBi Transformers
-
Context Engineering: From Prompts to Corporate Multi-Agent Architecture
-
Physics-Driven 3D Gaussian Rendering for Zero-Shot MRI Super-Resolution
-
Decoder-Free Distillation for Quantized Image Restoration
-
Grounding Synthetic Data Generation With Vision and Language Models
-
X-GS: An Extensible Open Framework for Perceiving and Thinking via 3D Gaussian Splatting
-
Tracking Cancer Through Text: Longitudinal Extraction From Radiology Reports Using Open-Source Large Language Models
-
PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution
-
Multi-DNN Inference of Sparse Models on Edge SoCs
-
MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings
-
Evolution of Photonic Quantum Machine Learning under Noise
-
Well Log-Guided Synthesis of Subsurface Images from Sparse Petrography Data Using cGANs
-
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants
-
OTPL-VIO: Robust Visual-Inertial Odometry with Optimal Transport Line Association and Adaptive Uncertainty
-
Understanding the Interplay between LLMs' Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025
-
When to Lock Attention: Training-Free KV Control in Video Diffusion
-
FreqCycle: A Multi-Scale Time-Frequency Analysis Method for Time Series Forecasting
-
No evaluation without fair representation : Impact of label and selection bias on the evaluation, performance and mitigation of classification models
-
DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics
-
VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM
-
KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization
-
GNNs for Time Series Anomaly Detection: An Open-Source Framework and a Critical Evaluation
-
Logics-Parsing-Omni Technical Report
-
EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages
-
Improving 3D Foot Motion Reconstruction in Markerless Monocular Human Motion Capture
-
On Catastrophic Forgetting in Low-Rank Decomposition-Based Parameter-Efficient Fine-Tuning
-
Automatic Cardiac Risk Management Classification using large-context Electronic Patients Health Records
-
Fusing Semantic, Lexical, and Domain Perspectives for Recipe Similarity Estimation
-
AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question Answering
-
ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling
-
ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning
-
Physics-informed neural operator for predictive parametric phase-field modelling
-
DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds
-
TemporalDoRA: Temporal PEFT for Robust Surgical Video Question Answering
-
Mousse: Rectifying the Geometry of Muon with Curvature-Aware Preconditioning
-
TriFusion-SR: Joint Tri-Modal Medical Image Fusion and SR
-
ProGS: Towards Progressive Coding for 3D Gaussian Splatting
