Picture for Dit-Yan Yeung

Dit-Yan Yeung

RoboDreamer: Learning Compositional World Models for Robot Imagination

Add code
Apr 18, 2024
Viaarxiv icon

Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases

Add code
Apr 16, 2024
Figure 1 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 2 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 3 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 4 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Viaarxiv icon

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Add code
Mar 22, 2024
Figure 1 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 2 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 3 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 4 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Viaarxiv icon

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Mar 20, 2024
Figure 1 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 2 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 3 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Figure 4 for DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Viaarxiv icon

TransformMix: Learning Transformation and Mixing Strategies from Data

Mar 19, 2024
Figure 1 for TransformMix: Learning Transformation and Mixing Strategies from Data
Figure 2 for TransformMix: Learning Transformation and Mixing Strategies from Data
Figure 3 for TransformMix: Learning Transformation and Mixing Strategies from Data
Figure 4 for TransformMix: Learning Transformation and Mixing Strategies from Data
Viaarxiv icon

Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning

Add code
Dec 19, 2023
Viaarxiv icon

TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models

Add code
Dec 01, 2023
Viaarxiv icon

Gaussian Shell Maps for Efficient 3D Human Generation

Add code
Nov 29, 2023
Viaarxiv icon

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

Add code
Oct 20, 2023
Figure 1 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Figure 2 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Figure 3 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Figure 4 for Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
Viaarxiv icon

Towards General Error Diagnosis via Behavioral Testing in Machine Translation

Add code
Oct 20, 2023
Figure 1 for Towards General Error Diagnosis via Behavioral Testing in Machine Translation
Figure 2 for Towards General Error Diagnosis via Behavioral Testing in Machine Translation
Figure 3 for Towards General Error Diagnosis via Behavioral Testing in Machine Translation
Figure 4 for Towards General Error Diagnosis via Behavioral Testing in Machine Translation
Viaarxiv icon