Papers
-
MCP-38: A Comprehensive Threat Taxonomy for Model Context Protocol Systems (v1.0)
-
Directing the Narrative: A Finetuning Method for Controlling Coherence and Style in Story Generation
-
GUIDE: GenAI Units In Digital Design Education
-
WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation
-
From Words to Worlds: Benchmarking Cross-Cultural Cultural Understanding in Machine Translation
-
3D MRI-Based Alzheimer's Disease Classification Using Multi-Modal 3D CNN with Leakage-Aware Subject-Level Evaluation
-
Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations
-
Beyond bouba/kiki: Multidimensional semantic signals are deeply woven into the fabric of natural language
-
Symphony: A Cognitively-Inspired Multi-Agent System for Long-Video Understanding
-
A Synthesizable RTL Implementation of Predictive Coding Networks
-
ReLMXEL: Adaptive RL-Based Memory Controller with Explainable Energy and Latency Optimization
-
InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning
-
Ruyi2.5 Technical Report
-
Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress
-
A Proposal-Free Query-Guided Network for Grounded Multimodal Named Entity Recognition
-
Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing
-
DarkDriving: A Real-World Day and Night Aligned Dataset for Autonomous Driving in the Dark Environment
-
ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling
-
MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation
-
FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions
-
A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication
-
Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures
-
EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection
-
Learning Permutation Distributions via Reflected Diffusion on Ranks
-
Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity
-
OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery
-
PACE-RAG: Patient-Aware Contextual and Evidence-based Policy RAG for Clinical Drug Recommendation
-
WebPII: Benchmarking Visual PII Detection for Computer-Use Agents
-
A 3D Reconstruction Benchmark for Asset Inspection
-
MCoT-MVS: Multi-level Vision Selection by Multi-modal Chain-of-Thought Reasoning for Composed Image Retrieval
-
Public Profile Matters: A Scalable Integrated Approach to Recommend Citations in the Wild
-
Continually self-improving AI
-
Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augmentation and Robust Noise Reduction
-
Variational Kernel Design for Internal Noise: Gaussian Chaos Noise, Representation Compatibility, and Reliable Deep Learning
-
Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation
-
Material Magic Wand: Material-Aware Grouping of 3D Parts in Untextured Meshes
-
Understanding and Defending VLM Jailbreaks via Jailbreak-Related Representation Shift
-
MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation
-
Interpretable Context Methodology: Folder Structure as Agentic Architecture
-
Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery
-
3D tomography of exchange phase in a Si/SiGe quantum dot device
-
Residual Stream Duality in Modern Transformer Architectures
-
Power Analysis for Prediction-Powered Inference
-
Shuffling the Stochastic Mirror Descent via Dual Lipschitz Continuity and Kernel Conditioning
-
Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition
-
Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation
-
POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs
-
The Era of End-to-End Autonomy: Transitioning from Rule-Based Driving to Large Driving Models
-
A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog
-
ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning
MongoDB - Build AI That Scales
