Picture for Heng Wang

Heng Wang

Video Recognition in Portrait Mode

Add code
Dec 21, 2023
Viaarxiv icon

Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

Add code
Dec 19, 2023
Viaarxiv icon

Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens

Add code
Dec 12, 2023
Viaarxiv icon

InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Add code
Dec 04, 2023
Figure 1 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 2 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 3 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 4 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Viaarxiv icon

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models

Add code
Oct 04, 2023
Figure 1 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Figure 2 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Figure 3 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Figure 4 for Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Viaarxiv icon

Resolving Knowledge Conflicts in Large Language Models

Add code
Oct 02, 2023
Viaarxiv icon

The Devil is in the Details: A Deep Dive into the Rabbit Hole of Data Filtering

Add code
Sep 27, 2023
Viaarxiv icon

Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development

Add code
Sep 24, 2023
Figure 1 for Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development
Figure 2 for Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development
Figure 3 for Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development
Figure 4 for Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development
Viaarxiv icon

Dataset Condensation via Generative Model

Add code
Sep 14, 2023
Viaarxiv icon