Picture for Zehui Chen

Zehui Chen

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Add code
Jun 06, 2024
Figure 1 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 2 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 3 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 4 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Viaarxiv icon

Are We on the Right Way for Evaluating Large Vision-Language Models?

Add code
Apr 09, 2024
Figure 1 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Figure 2 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Figure 3 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Figure 4 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Viaarxiv icon

PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

Add code
Mar 26, 2024
Figure 1 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Figure 2 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Figure 3 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Figure 4 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

Add code
Mar 25, 2024
Figure 1 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 2 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 3 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 4 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Viaarxiv icon

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Add code
Mar 19, 2024
Figure 1 for Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Figure 2 for Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Figure 3 for Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Figure 4 for Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Viaarxiv icon

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track

Add code
Feb 27, 2024
Viaarxiv icon

Stream Query Denoising for Vectorized HD Map Construction

Add code
Jan 18, 2024
Figure 1 for Stream Query Denoising for Vectorized HD Map Construction
Figure 2 for Stream Query Denoising for Vectorized HD Map Construction
Figure 3 for Stream Query Denoising for Vectorized HD Map Construction
Figure 4 for Stream Query Denoising for Vectorized HD Map Construction
Viaarxiv icon

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step

Add code
Jan 15, 2024
Viaarxiv icon

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding

Add code
Dec 21, 2023
Viaarxiv icon