Papers
-
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection
-
Neuronal Self-Adaptation Enhances Capacity and Robustness of Representation in Spiking Neural Networks
-
Artificial Intelligence in Experimental Approaches: Growth Hacking, Lean Startup, Design Thinking, and Agile
-
MFSR: MeanFlow Distillation for One Step Real-World Image Super Resolution
-
SWE-Next: Scalable Real-World Software Engineering Tasks for Agents
-
Can I guess where you are from? Modeling dialectal morphosyntactic similarities in Brazilian Portuguese
-
High-dimensional online learning via asynchronous decomposition: Non-divergent results, dynamic regularization, and beyond
-
Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models
-
Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs
-
mmWave-Diffusion:A Novel Framework for Respiration Sensing Using Observation-Anchored Conditional Diffusion Model
-
Decoupling Numerical and Structural Parameters: An Empirical Study on Adaptive Genetic Algorithms via Deep Reinforcement Learning for the Large-Scale TSP
-
NDT: Non-Differential Transformer and Its Application to Sentiment Analysis
-
High-Quality and Efficient Turbulence Mitigation with Events
-
RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models
-
The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting
-
Cross-modal Fuzzy Alignment Network for Text-Aerial Person Retrieval and A Large-scale Benchmark
-
Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction
-
Premier: Personalized Preference Modulation with Learnable User Embedding in Text-to-Image Generation
-
Problems with Chinchilla Approach 2: Systematic Biases in IsoFLOP Parabola Fits
-
Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention
-
Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks
-
VSD-MOT: End-to-End Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Distillation
-
MzansiText and MzansiLM: An Open Corpus and Decoder-Only Language Model for South African Languages
-
SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval
-
Mamba Learns in Context: Structure-Aware Domain Generalization for Multi-Task Point Cloud Understanding
-
CTCal: Rethinking Text-to-Image Diffusion Models via Cross-Timestep Self-Calibration
-
Adversarial Attacks on Locally Private Graph Neural Networks
-
Modeling Epistemic Uncertainty in Social Perception via Rashomon Set Agents
-
Smart Operation Theatre: An AI-based System for Surgical Gauze Counting
-
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
-
Graphs RAG at Scale: Beyond Retrieval-Augmented Generation With Labeled Property Graphs and Resource Description Framework for Complex and Unknown Search Spaces
-
Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness
-
OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation
-
PiLoT: Neural Pixel-to-3D Registration for UAV-based Ego and Target Geo-localization
-
Code-MIE: A Code-style Model for Multimodal Information Extraction with Scene Graph and Entity Attribute Knowledge Enhancement
-
MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction
-
ME-IQA: Memory-Enhanced Image Quality Assessment via Re-Ranking
-
Neural Autoregressive Flows for Markov Boundary Learning
-
T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
-
The Anatomy of an Edit: Mechanism-Guided Activation Steering for Knowledge Editing
-
RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution
-
Large Neighborhood Search meets Iterative Neural Constraint Heuristics
-
Does Peer Observation Help? Vision-Sharing Collaboration for Vision-Language Navigation
-
Less is More in Semantic Space: Intrinsic Decoupling via Clifford-M for Fundus Image Classification
-
BenchBench: Benchmarking Automated Benchmark Generation
-
Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models
-
Lean Learning Beyond Clouds: Efficient Discrepancy-Conditioned Optical-SAR Fusion for Semantic Segmentation
-
GMPilot: An Expert AI Agent For FDA cGMP Compliance
-
PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching
-
Achieving $\widetilde{O}(1/ε)$ Sample Complexity for Bilinear Systems Identification under Bounded Noises
MongoDB - Build AI That Scales
