Papers
-
VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models
-
ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation
-
Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation
-
GraphiContact: Pose-aware Human-Scene Robust Contact Perception for Interactive Systems
-
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial Optimization
-
Rigorous Error Certification for Neural PDE Solvers: From Empirical Residuals to Solution Guarantees
-
Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation
-
Evaluating Counterfactual Strategic Reasoning in Large Language Models
-
ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis
-
DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge
-
SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits
-
Few-shot Acoustic Synthesis with Multimodal Flow Matching
-
Box Maze: A Process-Control Architecture for Reliable LLM Reasoning
-
MIDST Challenge at SaTML 2025: Membership Inference over Diffusion-models-based Synthetic Tabular data
-
Improving RCT-Based Treatment Effect Estimation Under Covariate Mismatch via Calibrated Alignment
-
OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards
-
Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting
-
How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation
-
A Novel Solution for Zero-Day Attack Detection in IDS using Self-Attention and Jensen-Shannon Divergence in WGAN-GP
-
The Exponentially Weighted Signature
-
FASTER: Rethinking Real-Time Flow VLAs
-
kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation
-
Tinted Frames: Question Framing Blinds Vision-Language Models
-
Robustness, Cost, and Attack-Surface Concentration in Phishing Detection
-
RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing
-
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
-
$R$-equivalence on Cubic Surfaces I: Existing Cases with Non-Trivial Universal Equivalence
-
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising
-
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
-
Rethinking Vector Field Learning for Generative Segmentation
-
DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding
-
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
-
Online Learning and Equilibrium Computation with Ranking Feedback
-
Spectrally-Guided Diffusion Noise Schedules
-
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
-
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing
-
FinTradeBench: A Financial Reasoning Benchmark for LLMs
-
Under One Sun: Multi-Object Generative Perception of Materials and Illumination
-
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
-
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
-
NavTrust: Benchmarking Trustworthiness for Embodied Navigation
-
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
-
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
-
Matryoshka Gaussian Splatting
-
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
-
Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation
-
AURORA: Adaptive Unified Representation for Robust Ultrasound Analysis
-
Factored Levenberg-Marquardt for Diffeomorphic Image Registration: An efficient optimizer for FireANTs
-
Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents
-
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
MongoDB - Build AI That Scales
