Papers
-
MemDLM: Memory-Enhanced DLM Training
-
Drop-In Perceptual Optimization for 3D Gaussian Splatting
-
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents
-
Confidence-Based Decoding is Provably Efficient for Diffusion Language Models
-
EgoGroups: A Benchmark For Detecting Social Groups of People in the Wild
-
Characterizing High-Capacity Janus Aminobenzene-Graphene Anode for Sodium-Ion Batteries with Machine Learning
-
Greater accessibility can amplify discrimination in generative AI
-
Efficient Universal Perception Encoder
-
TiCo: Time-Controllable Training for Spoken Dialogue Models
-
GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning
-
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
-
Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration
-
Repurposing Geometric Foundation Models for Multi-view Diffusion
-
Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels
-
The Dual Mechanisms of Spatial Reasoning in Vision-Language Models
-
3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing
-
DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models
-
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model
-
UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
-
End-to-End Training for Unified Tokenization and Latent Denoising
-
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
-
WorldCache: Content-Aware Caching for Accelerated Video World Models
-
Latent Style-based Quantum Wasserstein GAN for Drug Design
-
Probabilistic modeling over permutations using quantum computers
-
Computational Arbitrage in AI Model Markets
-
Spatially-Aware Evaluation Framework for Aerial LiDAR Point Cloud Semantic Segmentation: Distance-Based Metrics on Challenging Regions
-
OsteoFlow: Lyapunov-Guided Flow Distillation for Predicting Bone Remodeling after Mandibular Reconstruction
-
Neural Structure Embedding for Symbolic Regression via Continuous Structure Search and Coefficient Optimization
-
Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning
-
CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
-
mmFHE: mmWave Sensing with End-to-End Fully Homomorphic Encryption
-
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs
-
Static Scene Reconstruction from Dynamic Egocentric Videos
-
Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception
-
SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale
-
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding
-
LLM-guided headline rewriting for clickability enhancement without clickbait
-
A Theoretical Framework for Energy-Aware Gradient Pruning in Federated Learning
-
Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing
-
SPDE Methods for Nonparametric Bayesian Posterior Contraction and Laplace Approximation
-
Stability-Preserving Online Adaptation of Neural Closed-loop Maps
-
Wake Up to the Past: Using Memory to Model Fluid Wake Effects on Robots
-
Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
-
Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games
-
Tiny Inference-Time Scaling with Latent Verifiers
-
Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning
-
OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection
-
Sketch2CT: Multimodal Diffusion for Structure-Aware 3D Medical Volume Generation
-
Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals
-
Hebbian Attractor Networks for Robot Locomotion
MongoDB - Build AI That Scales
