Picture for Yushi Hu

Yushi Hu

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Unified Text-Image Generation with Weakness-Targeted Post-Training

Add code
Jan 07, 2026
Viaarxiv icon

GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation

Add code
Dec 18, 2025
Figure 1 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Figure 2 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Figure 3 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Figure 4 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Viaarxiv icon

Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image

Add code
Dec 18, 2025
Viaarxiv icon

MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation

Add code
May 23, 2025
Figure 1 for MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
Figure 2 for MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
Figure 3 for MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
Figure 4 for MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
Viaarxiv icon

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Add code
May 14, 2025
Viaarxiv icon

Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation

Add code
Apr 25, 2025
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

Decoding-Time Language Model Alignment with Multiple Objectives

Add code
Jun 27, 2024
Figure 1 for Decoding-Time Language Model Alignment with Multiple Objectives
Figure 2 for Decoding-Time Language Model Alignment with Multiple Objectives
Figure 3 for Decoding-Time Language Model Alignment with Multiple Objectives
Figure 4 for Decoding-Time Language Model Alignment with Multiple Objectives
Viaarxiv icon

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

Add code
Jun 24, 2024
Viaarxiv icon