Picture for Lu Yuan

Lu Yuan

Stephen

Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting

Add code
Aug 22, 2023
Viaarxiv icon

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration

Add code
Jun 08, 2023
Figure 1 for HQ-50K: A Large-scale, High-quality Dataset for Image Restoration
Figure 2 for HQ-50K: A Large-scale, High-quality Dataset for Image Restoration
Figure 3 for HQ-50K: A Large-scale, High-quality Dataset for Image Restoration
Figure 4 for HQ-50K: A Large-scale, High-quality Dataset for Image Restoration
Viaarxiv icon

Designing a Better Asymmetric VQGAN for StableDiffusion

Add code
Jun 07, 2023
Figure 1 for Designing a Better Asymmetric VQGAN for StableDiffusion
Figure 2 for Designing a Better Asymmetric VQGAN for StableDiffusion
Figure 3 for Designing a Better Asymmetric VQGAN for StableDiffusion
Figure 4 for Designing a Better Asymmetric VQGAN for StableDiffusion
Viaarxiv icon

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Add code
May 31, 2023
Viaarxiv icon

Image is First-order Norm+Linear Autoregressive

Add code
May 25, 2023
Viaarxiv icon

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

Add code
May 24, 2023
Figure 1 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 2 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 3 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 4 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Viaarxiv icon

i-Code Studio: A Configurable and Composable Framework for Integrative AI

Add code
May 23, 2023
Figure 1 for i-Code Studio: A Configurable and Composable Framework for Integrative AI
Figure 2 for i-Code Studio: A Configurable and Composable Framework for Integrative AI
Figure 3 for i-Code Studio: A Configurable and Composable Framework for Integrative AI
Figure 4 for i-Code Studio: A Configurable and Composable Framework for Integrative AI
Viaarxiv icon

i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

Add code
May 21, 2023
Figure 1 for i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
Figure 2 for i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
Figure 3 for i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
Figure 4 for i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
Viaarxiv icon

ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System

Add code
Apr 29, 2023
Figure 1 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 2 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 3 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 4 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Viaarxiv icon

OmniTracker: Unifying Object Tracking by Tracking-with-Detection

Add code
Mar 21, 2023
Viaarxiv icon