Picture for Rong-Cheng Tu

Rong-Cheng Tu

AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization

Add code
Aug 06, 2025
Viaarxiv icon

Intra-Trajectory Consistency for Reward Modeling

Add code
Jun 10, 2025
Viaarxiv icon

A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations

Add code
Jun 06, 2025
Viaarxiv icon

MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

VORTA: Efficient Video Diffusion via Routing Sparse Attention

Add code
May 24, 2025
Viaarxiv icon

T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation

Add code
May 23, 2025
Viaarxiv icon

Robust Distribution Alignment for Industrial Anomaly Detection under Distribution Shift

Add code
Mar 19, 2025
Viaarxiv icon

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

Add code
Dec 16, 2024
Viaarxiv icon

Distribution-Consistency-Guided Multi-modal Hashing

Add code
Dec 15, 2024
Figure 1 for Distribution-Consistency-Guided Multi-modal Hashing
Figure 2 for Distribution-Consistency-Guided Multi-modal Hashing
Figure 3 for Distribution-Consistency-Guided Multi-modal Hashing
Figure 4 for Distribution-Consistency-Guided Multi-modal Hashing
Viaarxiv icon