Picture for Yu Qiao

Yu Qiao

ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

Add code
Mar 31, 2024
Figure 1 for DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
Figure 2 for DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
Figure 3 for DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
Figure 4 for DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
Viaarxiv icon

Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence

Add code
Mar 28, 2024
Viaarxiv icon

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents

Add code
Mar 28, 2024
Figure 1 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Figure 2 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Figure 3 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Figure 4 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Viaarxiv icon

Assessment of Multimodal Large Language Models in Alignment with Human Values

Add code
Mar 26, 2024
Figure 1 for Assessment of Multimodal Large Language Models in Alignment with Human Values
Figure 2 for Assessment of Multimodal Large Language Models in Alignment with Human Values
Figure 3 for Assessment of Multimodal Large Language Models in Alignment with Human Values
Figure 4 for Assessment of Multimodal Large Language Models in Alignment with Human Values
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

Add code
Mar 24, 2024
Figure 1 for EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Figure 2 for EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Figure 3 for EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Figure 4 for EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Viaarxiv icon

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Mar 22, 2024
Viaarxiv icon

DreamDA: Generative Data Augmentation with Diffusion Models

Add code
Mar 19, 2024
Figure 1 for DreamDA: Generative Data Augmentation with Diffusion Models
Figure 2 for DreamDA: Generative Data Augmentation with Diffusion Models
Figure 3 for DreamDA: Generative Data Augmentation with Diffusion Models
Figure 4 for DreamDA: Generative Data Augmentation with Diffusion Models
Viaarxiv icon

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Add code
Mar 19, 2024
Figure 1 for MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Figure 2 for MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Figure 3 for MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Figure 4 for MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Viaarxiv icon

AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions

Add code
Mar 14, 2024
Figure 1 for AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions
Figure 2 for AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions
Figure 3 for AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions
Figure 4 for AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions
Viaarxiv icon