Picture for Weihua Luo

Weihua Luo

AI Business, Alibaba Group

Rethinking Multilingual Vision-Language Translation: Dataset, Evaluation, and Adaptation

Add code
Jun 13, 2025
Viaarxiv icon

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Add code
Jun 11, 2025
Viaarxiv icon

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Add code
Jun 05, 2025
Viaarxiv icon

Multimodal Tabular Reasoning with Privileged Structured Information

Add code
Jun 04, 2025
Viaarxiv icon

TransBench: Benchmarking Machine Translation for Industrial-Scale Applications

Add code
May 20, 2025
Viaarxiv icon

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Add code
May 08, 2025
Viaarxiv icon

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Add code
May 05, 2025
Viaarxiv icon

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Add code
Apr 22, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Add code
Apr 14, 2025
Viaarxiv icon

A Unified Agentic Framework for Evaluating Conditional Image Generation

Add code
Apr 09, 2025
Viaarxiv icon