Papers
-
Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction
-
Deep Adaptive Model-Based Design of Experiments
-
NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics
-
EFF-Grasp: Energy-Field Flow Matching for Physics-Aware Dexterous Grasp Generation
-
HIPO: Instruction Hierarchy via Constrained Reinforcement Learning
-
GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation
-
DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay
-
Execution-Grounded Credit Assignment for GRPO in Code Generation
-
AI-Generated Figures in Academic Publishing: Policies, Tools, and Practical Guidelines
-
Segmentation-before-Staining Improves Structural Fidelity in Virtual IHC-to-Multiplex IF Translation
-
SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation
-
STARK: Spatio-Temporal Attention for Representation of Keypoints for Continuous Sign Language Recognition
-
Homogeneous and Heterogeneous Consistency progressive Re-ranking for Visible-Infrared Person Re-identification
-
Topology-Guided Biomechanical Profiling: A White-Box Framework for Opportunistic Screening of Spinal Instability on Routine CT
-
SignNav: Leveraging Signage for Semantic Visual Navigation in Large-Scale Indoor Environments
-
Open-Source Reproduction and Explainability Analysis of Corrective Retrieval Augmented Generation
-
MemX: A Local-First Long-Term Memory System for AI Assistants
-
The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data
-
360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
-
KidsNanny: A Two-Stage Multimodal Content Moderation Pipeline Integrating Visual Classification, Object Detection, OCR, and Contextual Reasoning for Child Safety
-
Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR
-
A General Deep Learning Framework for Wireless Resource Allocation under Discrete Constraints
-
Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift
-
ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control
-
Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning
-
Structured Semantic Cloaking for Jailbreak Attacks on Large Language Models
-
S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight
-
Are Large Language Models Truly Smarter Than Humans?
-
Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation
-
Robust Generative Audio Quality Assessment: Disentangling Quality from Spurious Correlations
-
A Scoping Review of AI-Driven Digital Interventions in Mental Health Care: Mapping Applications Across Screening, Support, Monitoring, Prevention, and Clinical Education
-
Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning
-
Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes
-
MOSAIC: Composable Safety Alignment with Modular Control Tokens
-
Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation
-
CoMAI: A Collaborative Multi-Agent Framework for Robust and Equitable Interview Evaluation
-
Generative AI for Quantum Circuits and Quantum Code: A Technical Review and Taxonomy
-
SpecSteer: Synergizing Local Context and Global Reasoning for Efficient Personalized Generation
-
Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism
-
Ground Reaction Inertial Poser: Physics-based Human Motion Capture from Sparse IMUs and Insole Pressure Sensors
-
ReFORM: Review-aggregated Profile Generation via LLM with Multi-Factor Attention for Restaurant Recommendation
-
PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space
-
Neural Pushforward Samplers for the Fokker-Planck Equation on Embedded Riemannian Manifolds
-
Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting
-
RASLF: Representation-Aware State Space Model for Light Field Super-Resolution
-
More Rounds, More Noise: Why Multi-Turn Review Fails to Improve Cross-Context Verification
-
How to Utilize Complementary Vision-Text Information for 2D Structure Understanding
-
Synergizing Deep Learning and Biological Heuristics for Extreme Long-Tail White Blood Cell Classification
-
Visual Prompt Discovery via Semantic Exploration
-
Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models
MongoDB - Build AI That Scales
