Picture for Guohai Xu

Guohai Xu

An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

Add code
Nov 13, 2023
Figure 1 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Figure 2 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Figure 3 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Figure 4 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Viaarxiv icon

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

Add code
Oct 08, 2023
Figure 1 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 2 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 3 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 4 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Viaarxiv icon

Evaluation and Analysis of Hallucination in Large Vision-Language Models

Add code
Aug 29, 2023
Figure 1 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Figure 2 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Figure 3 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Figure 4 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Viaarxiv icon

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility

Add code
Jul 19, 2023
Figure 1 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Figure 2 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Figure 3 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Figure 4 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Viaarxiv icon

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Add code
Jul 04, 2023
Figure 1 for mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Figure 2 for mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Figure 3 for mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Figure 4 for mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Viaarxiv icon

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks

Add code
Jun 07, 2023
Figure 1 for Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
Figure 2 for Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
Figure 3 for Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
Figure 4 for Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
Viaarxiv icon

Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering

Add code
May 21, 2023
Figure 1 for Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Figure 2 for Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Figure 3 for Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Figure 4 for Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Viaarxiv icon

AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference

Add code
May 13, 2023
Figure 1 for AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference
Figure 2 for AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference
Figure 3 for AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference
Figure 4 for AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference
Viaarxiv icon

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Add code
Apr 28, 2023
Figure 1 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human
Figure 2 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human
Figure 3 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human
Figure 4 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human
Viaarxiv icon

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality

Add code
Apr 27, 2023
Figure 1 for mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Figure 2 for mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Figure 3 for mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Figure 4 for mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Viaarxiv icon