Papers
-
FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language ModelsShanghaiTech University, University of Technology Sydney
-
Multi-level meta-reinforcement learning with skill-based curriculumJohns Hopkins University
-
Granulon: Awakening Pixel-Level Visual Encoders with Adaptive Multi-Granularity Semantics for MLLMNanyang Technological University, National University of Singapore
-
Large Language Model-Assisted Superconducting Qubit ExperimentsAalto University, Rensselaer Polytechnic Institute, University of Chicago
-
The Temporal Markov Transition Field
-
Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral SpecificationsFiverrLabs
-
Where, What, Why: Toward Explainable 3D-GS WatermarkingNanyang Technological University, Southeast University, Waseda University
-
VisionCreator-R1: A Reflection-Enhanced Native Visual-Generation Agentic ModelHong Kong University of Science and Technology, Tencent Hunyuan
-
Scale-Plan: Scalable Language-Enabled Task Planning for Heterogeneous Multi-Robot TeamsHonda Research Institute, University of California, Riverside
-
Training Language Models via Neural Cellular AutomataImprobable AI Lab, Massachusetts Institute of Technology
-
Beyond Relevance: On the Relationship Between Retrieval and RAG Information CoverageJohns Hopkins University, National Institute of Standards and Technology, University of New Hampshire
-
Fish Audio S2 Technical Report
-
SoftJAX & SoftTorch: Empowering Automatic Differentiation Libraries with Informative GradientsMasaryk University, Max Planck Institute for Biogeochemistry, Max Planck Institute for Intelligent Systems, University of Tübingen
-
Are Expressive Encoders Necessary for Discrete Graph Generation?Michigan State University, University of Texas at Arlington
-
Computer Vision-Based Vehicle Allotment System using Perspective MappingNational Institute of Technology Rourkela
-
MASEval: Extending Multi-Agent Evaluation from Models to SystemsParameter Lab / Korea Advanced Institute of Science and Technology, Mohamed bin Zayed University of Artificial Intelligence, NAVER AI Lab, TU Darmstadt, University of Oxford, University of Tübingen
-
A Lightweight Multi-Cancer Tumor Localization Framework for Deployable Digital PathologyIndiana University, University of Pittsburgh, UPMC Hillman Cancer Center
-
HECTOR: Hybrid Editable Compositional Object References for Video Generation
-
SBOMs into Agentic AIBOMs: Schema Extensions, Agentic Orchestration, and Reproducibility EvaluationAlan Turing Institute, University of Oxford
-
LDP: An Identity-Aware Protocol for Multi-Agent LLM SystemsIndian School of Business
-
Unpacking Interpretability: Human-Centered Criteria for Optimal Combinatorial SolutionsTechnical University of Darmstadt, University of Vienna
-
Expressivity-Efficiency Tradeoffs for Hybrid Sequence ModelsUniversity of Wisconsin-Madison
-
APPLV: Adaptive Planner Parameter Learning from Vision-Language-Action ModelGeorge Mason University, Rutgers University, University of South Florida
-
Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case StudyKTH Royal Institute of Technology
-
One Language, Two Scripts: Probing Script-Invariance in LLM Concept RepresentationsColumbia University
-
Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM SearchLouisiana State University
-
MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal IdentifiersBerlin Institute for the Foundations of Learning and Data, Charité – Universitätsmedizin Berlin, German Research Center for Artificial Intelligence, Humboldt-Universität zu Berlin, Technische Universitat Berlin, University of Potsdam
-
From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial ElectrocatalystsRuhr-Universität Bochum
-
Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving ArchitecturesClemson University
-
Towards Visual Query Segmentation in the WildUniversity of North Texas
-
ConFu: Contemplate the Future for Better Speculative SamplingQualcomm AI Research, University of California
-
A New Modeling to Feature Selection Based on the Fuzzy Rough Set Theory in Normal and Optimistic States on Hybrid Information SystemsIslamic Azad University
-
NetDiffuser: Deceiving DNN-Based Network Attack Detection Systems with Diffusion-Generated Adversarial TrafficNew Mexico State University, The U.S. Army Combat Capabilities Development Command, University of Hartford
-
Multi-Kernel Gated Decoder Adapters for Robust Multi-Task Thyroid Ultrasound under Cross-Center ShiftBC Cancer Research Institute, University of British Columbia
-
Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting
-
SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex ComputationJohns Hopkins University
-
FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID DataSapienza University of Rome
-
Quantifying Memorization and Privacy Risks in Genomic Language ModelsCase Western Reserve University, Rutgers University, University of Texas
-
Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli GatesBar-Ilan University
-
Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical ReasoningMcMaster University, Queen’s University, University of British Columbia, Vector Institute
-
Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents
-
Quantifying Uncertainty in AI Visibility: A Statistical Framework for Generative Search Measurement
-
MEGC2026: Micro-Expression Grand Challenge on Visual Question AnsweringHeriot-Watt University Malaysia, Manchester Metropolitan University, University of Chinese Academy of Sciences
-
TIDE: Text-Informed Dynamic Extrapolation with Step-Aware Temperature Control for Diffusion TransformersRice University, University of Houston
-
Using Vision Language Foundation Models to Generate Plant Simulation Configurations via In-Context LearningUniversity of California
-
Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity NetworksThe Institute of Electrical and Electronics Engineers
-
Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality AssuranceOld Dominion University
-
PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration
-
VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMsThe University of Sheffield, University of Southern California
-
Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges AheadGeorgia Institute of Technology, University of California
MongoDB - Build AI That Scales
