Picture for Zhihao Fan

Zhihao Fan

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks

Add code
Oct 17, 2023
Figure 1 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 2 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 3 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 4 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Viaarxiv icon

Query Structure Modeling for Inductive Logical Reasoning Over Knowledge Graphs

Add code
May 23, 2023
Viaarxiv icon

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Add code
May 19, 2023
Viaarxiv icon

Unifying Structure Reasoning and Language Model Pre-training for Complex Reasoning

Add code
Jan 21, 2023
Viaarxiv icon

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model

Add code
Dec 22, 2022
Viaarxiv icon

Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

Add code
Nov 07, 2022
Figure 1 for Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Figure 2 for Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Figure 3 for Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Figure 4 for Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Viaarxiv icon

Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering

Add code
Aug 22, 2022
Figure 1 for Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering
Figure 2 for Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering
Figure 3 for Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering
Figure 4 for Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering
Viaarxiv icon

A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training

Add code
Jun 11, 2022
Figure 1 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Figure 2 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Figure 3 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Figure 4 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Viaarxiv icon

MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment

Add code
Jan 29, 2022
Figure 1 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Figure 2 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Figure 3 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Figure 4 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Viaarxiv icon

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval

Add code
Nov 05, 2021
Figure 1 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Figure 2 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Figure 3 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Figure 4 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Viaarxiv icon