Picture for Bin Wen

Bin Wen

SpatialFlow-GRPO: Where Spatial Credit Drives Image Editing

Add code
Jun 25, 2026
Viaarxiv icon

Kwai Keye-VL-2.0 Technical Report

Add code
Jun 09, 2026
Viaarxiv icon

Beyond Generative Decoding: Discriminative Hidden-State Readout from a Native Omni-Modal LLM for Multimodal Sentiment Analysis

Add code
Jun 04, 2026
Viaarxiv icon

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning

Add code
May 27, 2026
Viaarxiv icon

Wan-Image: Pushing the Boundaries of Generative Visual Intelligence

Add code
Apr 21, 2026
Viaarxiv icon

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

Add code
Apr 03, 2026
Viaarxiv icon

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL

Add code
Feb 26, 2026
Viaarxiv icon

Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

Add code
Feb 19, 2026
Viaarxiv icon

Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Add code
Feb 16, 2026
Viaarxiv icon

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Add code
Feb 15, 2026
Viaarxiv icon