Picture for Yushi Hu

Yushi Hu

Decoding-Time Language Model Alignment with Multiple Objectives

Add code
Jun 27, 2024
Viaarxiv icon

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

Add code
Jun 24, 2024
Viaarxiv icon

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Add code
Jun 13, 2024
Viaarxiv icon

BLINK: Multimodal Large Language Models Can See but Not Perceive

Add code
Apr 18, 2024
Figure 1 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 2 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 3 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 4 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Viaarxiv icon

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Add code
Feb 06, 2024
Viaarxiv icon

Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models

Add code
Dec 05, 2023
Viaarxiv icon

DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback

Add code
Nov 29, 2023
Figure 1 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 2 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 3 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 4 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Viaarxiv icon

Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation

Add code
Oct 30, 2023
Figure 1 for Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Figure 2 for Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Figure 3 for Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Figure 4 for Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Viaarxiv icon

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Add code
Jun 02, 2023
Figure 1 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 2 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 3 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 4 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Viaarxiv icon

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Add code
Mar 28, 2023
Figure 1 for TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Figure 2 for TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Figure 3 for TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Figure 4 for TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Viaarxiv icon