Picture for Hongxu Yin

Hongxu Yin

Celine

$VILA^2$: VILA Augmented VILA

Add code
Jul 24, 2024
Viaarxiv icon

Flextron: Many-in-One Flexible Large Language Model

Add code
Jun 11, 2024
Figure 1 for Flextron: Many-in-One Flexible Large Language Model
Figure 2 for Flextron: Many-in-One Flexible Large Language Model
Figure 3 for Flextron: Many-in-One Flexible Large Language Model
Figure 4 for Flextron: Many-in-One Flexible Large Language Model
Viaarxiv icon

Step Out and Seek Around: On Warm-Start Training with Incremental Data

Add code
Jun 06, 2024
Figure 1 for Step Out and Seek Around: On Warm-Start Training with Incremental Data
Figure 2 for Step Out and Seek Around: On Warm-Start Training with Incremental Data
Figure 3 for Step Out and Seek Around: On Warm-Start Training with Incremental Data
Figure 4 for Step Out and Seek Around: On Warm-Start Training with Incremental Data
Viaarxiv icon

SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model

Add code
Jun 03, 2024
Figure 1 for SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
Figure 2 for SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
Figure 3 for SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
Figure 4 for SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
Viaarxiv icon

X-VILA: Cross-Modality Alignment for Large Language Model

Add code
May 29, 2024
Figure 1 for X-VILA: Cross-Modality Alignment for Large Language Model
Figure 2 for X-VILA: Cross-Modality Alignment for Large Language Model
Figure 3 for X-VILA: Cross-Modality Alignment for Large Language Model
Figure 4 for X-VILA: Cross-Modality Alignment for Large Language Model
Viaarxiv icon

LITA: Language Instructed Temporal-Localization Assistant

Add code
Mar 27, 2024
Figure 1 for LITA: Language Instructed Temporal-Localization Assistant
Figure 2 for LITA: Language Instructed Temporal-Localization Assistant
Figure 3 for LITA: Language Instructed Temporal-Localization Assistant
Figure 4 for LITA: Language Instructed Temporal-Localization Assistant
Viaarxiv icon

RegionGPT: Towards Region Understanding Vision Language Model

Add code
Mar 04, 2024
Figure 1 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 2 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 3 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 4 for RegionGPT: Towards Region Understanding Vision Language Model
Viaarxiv icon

DoRA: Weight-Decomposed Low-Rank Adaptation

Add code
Feb 14, 2024
Viaarxiv icon

VILA: On Pre-training for Visual Language Models

Add code
Dec 14, 2023
Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models

Add code
Oct 02, 2023
Figure 1 for FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models
Figure 2 for FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models
Figure 3 for FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models
Figure 4 for FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models
Viaarxiv icon