Picture for Pan Gao

Pan Gao

HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

Add code
Aug 20, 2025
Viaarxiv icon

PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models

Add code
May 06, 2025
Figure 1 for PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models
Figure 2 for PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models
Figure 3 for PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models
Figure 4 for PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models
Viaarxiv icon

Uncertainty Guided Refinement for Fine-Grained Salient Object Detection

Add code
Apr 13, 2025
Viaarxiv icon

Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos

Add code
Apr 07, 2025
Viaarxiv icon

Relevance-guided Audio Visual Fusion for Video Saliency Prediction

Add code
Nov 18, 2024
Viaarxiv icon

Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds

Add code
Oct 23, 2024
Figure 1 for Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds
Figure 2 for Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds
Figure 3 for Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds
Figure 4 for Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds
Viaarxiv icon

DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Add code
Oct 19, 2024
Figure 1 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer
Figure 2 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer
Figure 3 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer
Figure 4 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer
Viaarxiv icon

Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function

Add code
Sep 30, 2024
Figure 1 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Figure 2 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Figure 3 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Figure 4 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Viaarxiv icon

Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation

Add code
Sep 11, 2024
Figure 1 for Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
Figure 2 for Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
Figure 3 for Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
Figure 4 for Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
Viaarxiv icon

Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds

Add code
Aug 20, 2024
Figure 1 for Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Figure 2 for Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Figure 3 for Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Figure 4 for Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Viaarxiv icon