Picture for Lei He

Lei He

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Mar 05, 2024
Viaarxiv icon

SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models

Add code
Feb 06, 2024
Figure 1 for SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models
Figure 2 for SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models
Figure 3 for SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models
Figure 4 for SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models
Viaarxiv icon

A Risk-aware Planning Framework of UGVs in Off-Road Environment

Add code
Feb 04, 2024
Figure 1 for A Risk-aware Planning Framework of UGVs in Off-Road Environment
Figure 2 for A Risk-aware Planning Framework of UGVs in Off-Road Environment
Figure 3 for A Risk-aware Planning Framework of UGVs in Off-Road Environment
Figure 4 for A Risk-aware Planning Framework of UGVs in Off-Road Environment
Viaarxiv icon

StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis

Add code
Dec 19, 2023
Viaarxiv icon

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

Add code
Oct 10, 2023
Figure 1 for ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Figure 2 for ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Figure 3 for ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Figure 4 for ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Viaarxiv icon

Orbital AI-based Autonomous Refuelling Solution

Add code
Sep 20, 2023
Figure 1 for Orbital AI-based Autonomous Refuelling Solution
Figure 2 for Orbital AI-based Autonomous Refuelling Solution
Figure 3 for Orbital AI-based Autonomous Refuelling Solution
Figure 4 for Orbital AI-based Autonomous Refuelling Solution
Viaarxiv icon

MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023

Add code
Sep 12, 2023
Viaarxiv icon

Large-Scale Automatic Audiobook Creation

Add code
Sep 07, 2023
Figure 1 for Large-Scale Automatic Audiobook Creation
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Sep 05, 2023
Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene

Add code
Jul 27, 2023
Figure 1 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 2 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 3 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 4 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Viaarxiv icon