Picture for Chunhua Shen

Chunhua Shen

The University of Adelaide

DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data

Add code
May 16, 2024
Viaarxiv icon

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

Add code
Apr 30, 2024
Figure 1 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Figure 2 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Figure 3 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Figure 4 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Viaarxiv icon

Deepfake Generation and Detection: A Benchmark and Survey

Add code
Apr 09, 2024
Figure 1 for Deepfake Generation and Detection: A Benchmark and Survey
Figure 2 for Deepfake Generation and Detection: A Benchmark and Survey
Figure 3 for Deepfake Generation and Detection: A Benchmark and Survey
Figure 4 for Deepfake Generation and Detection: A Benchmark and Survey
Viaarxiv icon

Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Add code
Mar 19, 2024
Figure 1 for Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model
Figure 2 for Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model
Figure 3 for Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model
Figure 4 for Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model
Viaarxiv icon

3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models

Add code
Mar 17, 2024
Viaarxiv icon

Diffusion Models Trained with Large Data Are Transferable Visual Models

Add code
Mar 15, 2024
Figure 1 for Diffusion Models Trained with Large Data Are Transferable Visual Models
Figure 2 for Diffusion Models Trained with Large Data Are Transferable Visual Models
Figure 3 for Diffusion Models Trained with Large Data Are Transferable Visual Models
Figure 4 for Diffusion Models Trained with Large Data Are Transferable Visual Models
Viaarxiv icon

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

Add code
Mar 01, 2024
Viaarxiv icon

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Add code
Feb 06, 2024
Viaarxiv icon

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices

Add code
Dec 30, 2023
Figure 1 for MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Figure 2 for MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Figure 3 for MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Figure 4 for MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Viaarxiv icon

GenDeF: Learning Generative Deformation Field for Video Generation

Add code
Dec 07, 2023
Figure 1 for GenDeF: Learning Generative Deformation Field for Video Generation
Figure 2 for GenDeF: Learning Generative Deformation Field for Video Generation
Figure 3 for GenDeF: Learning Generative Deformation Field for Video Generation
Figure 4 for GenDeF: Learning Generative Deformation Field for Video Generation
Viaarxiv icon