Picture for Bin Chen

Bin Chen

PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment

Add code
Mar 19, 2026
Viaarxiv icon

OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution

Add code
Mar 13, 2026
Viaarxiv icon

GLM-OCR Technical Report

Add code
Mar 11, 2026
Viaarxiv icon

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Add code
Mar 02, 2026
Viaarxiv icon

DeAR: Fine-Grained VLM Adaptation by Decomposing Attention Head Roles

Add code
Mar 01, 2026
Viaarxiv icon

Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

Add code
Feb 28, 2026
Viaarxiv icon

SIGMA: A Semantic-Grounded Instruction-Driven Generative Multi-Task Recommender at AliExpress

Add code
Feb 26, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

Detecting Brick Kiln Infrastructure at Scale: Graph, Foundation, and Remote Sensing Models for Satellite Imagery Data

Add code
Feb 12, 2026
Viaarxiv icon

Seeing Through the Chain: Mitigate Hallucination in Multimodal Reasoning Models via CoT Compression and Contrastive Preference Optimization

Add code
Feb 03, 2026
Viaarxiv icon