Picture for Zilu Guo

Zilu Guo

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Add code
Mar 14, 2025
Figure 1 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 2 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 3 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 4 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Viaarxiv icon

PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

Add code
Mar 13, 2025
Figure 1 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 2 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 3 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 4 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Viaarxiv icon

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Add code
Nov 23, 2024
Figure 1 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 2 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 3 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 4 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Viaarxiv icon

DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation

Add code
Jun 06, 2024
Figure 1 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 2 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 3 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Figure 4 for DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation
Viaarxiv icon

A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition

Add code
May 27, 2024
Figure 1 for A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition
Figure 2 for A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition
Figure 3 for A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition
Figure 4 for A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition
Viaarxiv icon

Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Add code
May 24, 2024
Figure 1 for Quality-aware Masked Diffusion Transformer for Enhanced Music Generation
Figure 2 for Quality-aware Masked Diffusion Transformer for Enhanced Music Generation
Figure 3 for Quality-aware Masked Diffusion Transformer for Enhanced Music Generation
Figure 4 for Quality-aware Masked Diffusion Transformer for Enhanced Music Generation
Viaarxiv icon

Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning

Add code
Sep 17, 2023
Figure 1 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Figure 2 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Figure 3 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Figure 4 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Viaarxiv icon

Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement

Add code
Jun 14, 2023
Figure 1 for Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Figure 2 for Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Figure 3 for Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Figure 4 for Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Viaarxiv icon