Picture for Bohan Li

Bohan Li

On the Effectiveness of Acoustic BPE in Decoder-Only TTS

Add code
Jul 04, 2024
Viaarxiv icon

Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

Add code
Jul 02, 2024
Viaarxiv icon

Extreme Video Compression with Pre-trained Diffusion Models

Add code
Feb 14, 2024
Figure 1 for Extreme Video Compression with Pre-trained Diffusion Models
Figure 2 for Extreme Video Compression with Pre-trained Diffusion Models
Figure 3 for Extreme Video Compression with Pre-trained Diffusion Models
Figure 4 for Extreme Video Compression with Pre-trained Diffusion Models
Viaarxiv icon

Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE Distillation and Diffusion Probabilistic Feedback

Add code
Feb 04, 2024
Viaarxiv icon

Self-Supervised Dynamic Hypergraph Recommendation based on Hyper-Relational Knowledge Graph

Add code
Aug 15, 2023
Figure 1 for Self-Supervised Dynamic Hypergraph Recommendation based on Hyper-Relational Knowledge Graph
Figure 2 for Self-Supervised Dynamic Hypergraph Recommendation based on Hyper-Relational Knowledge Graph
Figure 3 for Self-Supervised Dynamic Hypergraph Recommendation based on Hyper-Relational Knowledge Graph
Figure 4 for Self-Supervised Dynamic Hypergraph Recommendation based on Hyper-Relational Knowledge Graph
Viaarxiv icon

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation

Add code
Jul 07, 2023
Figure 1 for One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation
Figure 2 for One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation
Figure 3 for One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation
Figure 4 for One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation
Viaarxiv icon

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model

Add code
Jun 20, 2023
Figure 1 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Figure 2 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Figure 3 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Figure 4 for EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Jun 05, 2023
Figure 1 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 2 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 3 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 4 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Viaarxiv icon

MDD-Enabled Two-Tier Terahertz Fronthaul in Indoor Industrial Cell-Free Massive MIMO

Add code
May 10, 2023
Figure 1 for MDD-Enabled Two-Tier Terahertz Fronthaul in Indoor Industrial Cell-Free Massive MIMO
Figure 2 for MDD-Enabled Two-Tier Terahertz Fronthaul in Indoor Industrial Cell-Free Massive MIMO
Figure 3 for MDD-Enabled Two-Tier Terahertz Fronthaul in Indoor Industrial Cell-Free Massive MIMO
Figure 4 for MDD-Enabled Two-Tier Terahertz Fronthaul in Indoor Industrial Cell-Free Massive MIMO
Viaarxiv icon

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation

Add code
Apr 22, 2023
Figure 1 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Figure 2 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Figure 3 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Figure 4 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Viaarxiv icon