Papers by Bin Chen
-
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
-
InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution
-
VQ-Jarvis: Retrieval-Augmented Video Restoration Agent with Sharp Vision and Fast Thought
-
PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment
-
OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution
-
GLM-OCR Technical Report
-
Looking Back and Forth: Cross-Image Attention Calibration and Attentive Preference Learning for Multi-Image Hallucination MitigationBeijing Institute of Technology, Harbin Institute of Technology, Tsinghua University
-
GLM-5: from Vibe Coding to Agentic Engineering
