Papers
-
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning
-
TumorChain: Interleaved Multimodal Chain-of-Thought Reasoning for Traceable Clinical Tumor Analysis
-
PatchCue: Enhancing Vision-Language Model Reasoning with Patch-Based Visual Cues
-
PolyBlocks: A Compiler Infrastructure for AI Chips and Programming Frameworks
-
Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image Segmentation
-
Stochastic Event Prediction via Temporal Motif Transitions
-
Systematic Evaluation of Novel View Synthesis for Video Place Recognition
-
ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning
-
Confidence Before Answering: A Paradigm Shift for Efficient LLM Uncertainty Estimation
-
CylinderSplat: 3D Gaussian Splatting with Cylindrical Triplanes for Panoramic Novel View Synthesis
-
VerChol -- Grammar-First Tokenization for Agglutinative Languages
-
Computational Pathology in the Era of Emerging Foundation and Agentic AI -- International Expert Perspectives on Clinical Integration and Translational Readiness
-
HERO: Hierarchical Embedding-Refinement for Open-Vocabulary Temporal Sentence Grounding in Videos
-
Reconstruct! Don't Encode: Self-Supervised Representation Reconstruction Loss for High-Intelligibility and Low-Latency Streaming Neural Audio Codec
-
PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction
-
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
-
Building an Ensemble LLM Semantic Tagger for UN Security Council Resolutions
-
InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster Generation
-
Mitigating Bias in Concept Bottleneck Models for Fair and Interpretable Image Classification
-
Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning
-
Calibrated Credit Intelligence: Shift-Robust and Fair Risk Scoring with Bayesian Uncertainty and Gradient Boosting
-
LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis
-
CollabOD: Collaborative Multi-Backbone with Cross-scale Vision for UAV Small Object Detection
-
Beyond Geometry: Artistic Disparity Synthesis for Immersive 2D-to-3D
-
Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic Image
-
InfoGatherer: Principled Information Seeking via Evidence Retrieval and Strategic Questioning
-
The World Won't Stay Still: Programmable Evolution for Agent Benchmarks
-
CORE-Seg: Reasoning-Driven Segmentation for Complex Lesions via Reinforcement Learning
-
DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality
-
Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis
-
Design Experiments to Compare Multi-armed Bandit Algorithms
-
BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response Deviation
-
Learning Next Action Predictors from Human-Computer Interaction
-
Weak-SIGReg: Covariance Regularization for Stable Deep Learning
-
RAC: Rectified Flow Auto Coder
-
Towards Driver Behavior Understanding: Weakly-Supervised Risk Perception in Driving Scenes
-
Addressing the Ecological Fallacy in Larger LMs with Human ContextStony Brook University, Vanderbilt University
-
Beyond Static Frames: Temporal Aggregate-and-Restore Vision Transformer for Human Pose EstimationZhejiang Gongshang University
-
A Persistent-State Dataflow Accelerator for Memory-Bound Linear Attention Decode on FPGAUniversity of Southern California
-
FTSplat: Feed-forward Triangle Splatting NetworkNankai University
-
Implicit Style Conditioning: A Structured Style-Rewrite Framework for Low-Resource Character ModelingGuangdong University of Finance
-
OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous DrivingChubu University
-
Facial Expression Recognition Using Residual Masking NetworkHo Chi Minh City University of Technology
-
SLER-IR: Spherical Layer-wise Expert Routing for All-in-One Image RestorationSichuan University, University of California, San Diego
-
XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable InsightsIslington College
-
Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew EstimationHo Chi Minh City University of Technology, Vietnam National University Ho Chi Minh City
-
Vessel-Aware Deep Learning for OCTA-Based Detection of AMDStony Brook University
-
LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-ResolutionThe Hong Kong University of Science and Technology
-
Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language ModelsThe Hong Kong University of Science and Technology
-
Unify the Views: View-Consistent Prototype Learning for Few-Shot SegmentationTongji University
