Picture for Jiuhai Chen

Jiuhai Chen

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Add code
Jun 08, 2025
Viaarxiv icon

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Add code
May 14, 2025
Viaarxiv icon

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Add code
Apr 10, 2025
Viaarxiv icon

Transfer between Modalities with MetaQueries

Add code
Apr 08, 2025
Viaarxiv icon

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Add code
Dec 05, 2024
Figure 1 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 2 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 3 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 4 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Viaarxiv icon

Multi-Objective Linguistic Control of Large Language Models

Add code
Jun 23, 2024
Figure 1 for Multi-Objective Linguistic Control of Large Language Models
Figure 2 for Multi-Objective Linguistic Control of Large Language Models
Figure 3 for Multi-Objective Linguistic Control of Large Language Models
Figure 4 for Multi-Objective Linguistic Control of Large Language Models
Viaarxiv icon

GenQA: Generating Millions of Instructions from a Handful of Prompts

Add code
Jun 14, 2024
Figure 1 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 2 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 3 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 4 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Viaarxiv icon

OPTune: Efficient Online Preference Tuning

Add code
Jun 11, 2024
Figure 1 for OPTune: Efficient Online Preference Tuning
Figure 2 for OPTune: Efficient Online Preference Tuning
Figure 3 for OPTune: Efficient Online Preference Tuning
Figure 4 for OPTune: Efficient Online Preference Tuning
Viaarxiv icon

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Add code
May 29, 2024
Figure 1 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Figure 2 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Figure 3 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Figure 4 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Viaarxiv icon

Automated Data Curation for Robust Language Model Fine-Tuning

Add code
Mar 19, 2024
Figure 1 for Automated Data Curation for Robust Language Model Fine-Tuning
Figure 2 for Automated Data Curation for Robust Language Model Fine-Tuning
Figure 3 for Automated Data Curation for Robust Language Model Fine-Tuning
Figure 4 for Automated Data Curation for Robust Language Model Fine-Tuning
Viaarxiv icon