Picture for Yu Qiao

Yu Qiao

ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Add code
Aug 05, 2024
Figure 1 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 2 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 3 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Figure 4 for MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Viaarxiv icon

DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving

Add code
Aug 01, 2024
Figure 1 for DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Figure 2 for DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Figure 3 for DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Figure 4 for DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Viaarxiv icon

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Add code
Jul 24, 2024
Figure 1 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Figure 2 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Figure 3 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Figure 4 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Viaarxiv icon

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Add code
Jul 23, 2024
Viaarxiv icon

MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity

Add code
Jul 22, 2024
Figure 1 for MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity
Figure 2 for MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity
Figure 3 for MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity
Figure 4 for MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity
Viaarxiv icon

ViLLa: Video Reasoning Segmentation with Large Language Model

Add code
Jul 18, 2024
Figure 1 for ViLLa: Video Reasoning Segmentation with Large Language Model
Figure 2 for ViLLa: Video Reasoning Segmentation with Large Language Model
Figure 3 for ViLLa: Video Reasoning Segmentation with Large Language Model
Figure 4 for ViLLa: Video Reasoning Segmentation with Large Language Model
Viaarxiv icon

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Add code
Jul 17, 2024
Figure 1 for GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity
Figure 2 for GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity
Figure 3 for GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity
Figure 4 for GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity
Viaarxiv icon

The Better Angels of Machine Personality: How Personality Relates to LLM Safety

Add code
Jul 17, 2024
Figure 1 for The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Figure 2 for The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Figure 3 for The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Figure 4 for The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Viaarxiv icon

GRUtopia: Dream General Robots in a City at Scale

Add code
Jul 15, 2024
Viaarxiv icon

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

Add code
Jul 11, 2024
Figure 1 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Figure 2 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Figure 3 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Figure 4 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Viaarxiv icon