Picture for Baotian Hu

Baotian Hu

VideoVista: A Versatile Benchmark for Video Understanding and Reasoning

Add code
Jun 17, 2024
Viaarxiv icon

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Add code
May 18, 2024
Viaarxiv icon

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

Add code
May 08, 2024
Figure 1 for VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Figure 2 for VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Figure 3 for VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Figure 4 for VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Viaarxiv icon

In-Context Learning State Vector with Inner and Momentum Optimization

Add code
Apr 17, 2024
Viaarxiv icon

Improving Attributed Text Generation of Large Language Models via Preference Learning

Add code
Mar 27, 2024
Figure 1 for Improving Attributed Text Generation of Large Language Models via Preference Learning
Figure 2 for Improving Attributed Text Generation of Large Language Models via Preference Learning
Figure 3 for Improving Attributed Text Generation of Large Language Models via Preference Learning
Figure 4 for Improving Attributed Text Generation of Large Language Models via Preference Learning
Viaarxiv icon

SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection

Add code
Feb 26, 2024
Viaarxiv icon

Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer

Add code
Feb 22, 2024
Figure 1 for Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
Figure 2 for Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
Figure 3 for Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
Figure 4 for Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
Viaarxiv icon

Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment

Add code
Feb 21, 2024
Viaarxiv icon

A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation

Add code
Feb 21, 2024
Figure 1 for A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
Figure 2 for A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
Figure 3 for A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
Figure 4 for A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
Viaarxiv icon

Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training

Add code
Dec 29, 2023
Figure 1 for Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training
Figure 2 for Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training
Figure 3 for Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training
Figure 4 for Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training
Viaarxiv icon