Picture for Heng Wang

Heng Wang

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Add code
Mar 05, 2024
Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models

Add code
Mar 05, 2024
Figure 1 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Figure 2 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Figure 3 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Figure 4 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Viaarxiv icon

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Add code
Feb 16, 2024
Figure 1 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Figure 2 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Figure 3 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Figure 4 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Viaarxiv icon

Video Recognition in Portrait Mode

Add code
Dec 21, 2023
Figure 1 for Video Recognition in Portrait Mode
Figure 2 for Video Recognition in Portrait Mode
Figure 3 for Video Recognition in Portrait Mode
Figure 4 for Video Recognition in Portrait Mode
Viaarxiv icon

Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

Add code
Dec 19, 2023
Viaarxiv icon

Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens

Add code
Dec 12, 2023
Figure 1 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Figure 2 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Figure 3 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Figure 4 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Viaarxiv icon

InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Add code
Dec 04, 2023
Figure 1 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 2 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 3 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 4 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Viaarxiv icon

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models

Add code
Oct 04, 2023
Figure 1 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Figure 2 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Figure 3 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Figure 4 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Viaarxiv icon

Resolving Knowledge Conflicts in Large Language Models

Add code
Oct 02, 2023
Figure 1 for Resolving Knowledge Conflicts in Large Language Models
Figure 2 for Resolving Knowledge Conflicts in Large Language Models
Figure 3 for Resolving Knowledge Conflicts in Large Language Models
Figure 4 for Resolving Knowledge Conflicts in Large Language Models
Viaarxiv icon