Picture for Pu Wang

Pu Wang

MMHMR: Generative Masked Modeling for Hand Mesh Recovery

Add code
Dec 18, 2024
Figure 1 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Figure 2 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Figure 3 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Figure 4 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Viaarxiv icon

Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation

Add code
Nov 26, 2024
Figure 1 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 2 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 3 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 4 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Viaarxiv icon

ControlMM: Controllable Masked Motion Generation

Add code
Oct 14, 2024
Viaarxiv icon

MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data

Add code
Sep 09, 2024
Figure 1 for MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data
Figure 2 for MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data
Figure 3 for MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data
Figure 4 for MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data
Viaarxiv icon

Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses

Add code
Jun 16, 2024
Viaarxiv icon

Complex Image-Generative Diffusion Transformer for Audio Denoising

Add code
Jun 13, 2024
Figure 1 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 2 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 3 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 4 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Viaarxiv icon

Diffusion Gaussian Mixture Audio Denoise

Add code
Jun 13, 2024
Viaarxiv icon

LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Add code
Jun 13, 2024
Viaarxiv icon

BAMM: Bidirectional Autoregressive Motion Model

Add code
Apr 01, 2024
Figure 1 for BAMM: Bidirectional Autoregressive Motion Model
Figure 2 for BAMM: Bidirectional Autoregressive Motion Model
Figure 3 for BAMM: Bidirectional Autoregressive Motion Model
Figure 4 for BAMM: Bidirectional Autoregressive Motion Model
Viaarxiv icon

MMM: Generative Masked Motion Model

Add code
Dec 06, 2023
Figure 1 for MMM: Generative Masked Motion Model
Figure 2 for MMM: Generative Masked Motion Model
Figure 3 for MMM: Generative Masked Motion Model
Figure 4 for MMM: Generative Masked Motion Model
Viaarxiv icon