Papers
-
TS-MLLM: A Multi-Modal Large Language Model-based Framework for Industrial Time-Series Big Data Analysis
-
Integration of deep generative Anomaly Detection algorithm in high-speed industrial line
-
KCoEvo: A Knowledge Graph Augmented Framework for Evolutionary Code Generation
-
Generalized Reduction to the Isotropy for Flexible Equivariant Neural Fields
-
Analysis-Driven Procedural Generation of an Engine Sound Dataset with Embedded Control Annotations
-
3DGS-HPC: Distractor-free 3D Gaussian Splatting with Hybrid Patch-wise Classification
-
Models as Lego Builders: Assembling Malice from Benign Blocks via Semantic Blueprints
-
Fast Attention-Based Simplification of LiDAR Point Clouds for Object Detection and Classification
-
Shorter Thoughts, Same Answers: Difficulty-Scaled Segment-Wise RL for CoT Compression
-
StyleBench: Evaluating Speech Language Models on Conversational Speaking Style Control
-
MetaSort: An Accelerated Approach for Non-uniform Compression and Few-shot Classification of Neural Spike Waveforms
-
EmbedTalk: Triplane-Free Talking Head Synthesis using Embedding-Driven Gaussian Deformation
-
TT-Sparse: Learning Sparse Rule Models with Differentiable Truth Tables
-
MAS-H2: A Hierarchical Multi-Agent System for Holistic Cloud-Native Autoscaling
-
KohakuRAG: A simple RAG framework with hierarchical document indexing
-
Looking Into the Water by Unsupervised Learning of the Surface Shape
-
Compression as Adaptation: Implicit Visual Representation with Diffusion Foundation Models
-
SMAT: Staged Multi-Agent Training for Co-Adaptive Exoskeleton Control
-
Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models
-
Duala: Dual-Level Alignment of Subjects and Stimuli for Cross-Subject fMRI Decoding
-
Accelerating Diffusion Models for Generative AI Applications with Silicon Photonics
-
Exoskeleton Control through Learning to Reduce Biological Joint Moments in Simulations
-
Real-Time Glottis Detection Framework via Spatial-decoupled Feature Learning for Nasal Transnasal Intubation
-
Helix: Evolutionary Reinforcement Learning for Open-Ended Scientific Problem Solving
-
Evaluating Synthetic Data for Baggage Trolley Detection in Airport Logistics
-
AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots
-
GLASS: Graph and Vision-Language Assisted Semantic Shape Correspondence
-
Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data
-
Partial Differential Equations in the Age of Machine Learning: A Critical Synthesis of Classical, Machine Learning, and Hybrid Methods
-
Scaling Test-Time Robustness of Vision-Language Models via Self-Critical Inference Framework
-
Ref-DGS: Reflective Dual Gaussian Splatting
-
AI-Driven Phase Identification from X-ray Hyperspectral Imaging of cycled Na-ion Cathode Materials
-
FusionRegister: Every Infrared and Visible Image Fusion Deserves Registration
-
Memory for Autonomous LLM Agents:Mechanisms, Evaluation, and Emerging Frontiers
-
Beyond Surrogates: A Quantitative Analysis for Inter-Metric Relationships
-
A Primer on Evolutionary Frameworks for Near-Field Multi-Source Localization
-
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural Techniques
-
UniUncer: Unified Dynamic Static Uncertainty for End to End Driving
-
FrameVGGT: Frame Evidence Rolling Memory for streaming VGGT
-
RoboPCA: Pose-centered Affordance Learning from Human Demonstrations for Robot Manipulation
-
Compressed-Domain-Aware Online Video Super-Resolution
-
Learning Context-Adaptive Motion Priors for Masked Motion Diffusion Models with Efficient Kinematic Attention Aggregation
-
Global Convergence of Average Reward Constrained MDPs with Neural Critic and General Policy Parameterization
-
EDMFormer: Genre-Specific Self-Supervised Learning for Music Structure Segmentation
-
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward
-
Step-Size Decay and Structural Stagnation in Greedy Sparse Learning
-
PARSE: Part-Aware Relational Spatial Modeling
-
Deep Incentive Design with Differentiable Equilibrium Blocks
-
VoiceSHIELD-Small: Real-Time Malicious Speech Detection and Transcription
-
YAQIN: Culturally Sensitive, Agentic AI for Mental Healthcare Support Among Muslim Women in the UK
