Picture for Zhanyu Ma

Zhanyu Ma

M4Fog: A Global Multi-Regional, Multi-Modal, and Multi-Stage Dataset for Marine Fog Detection and Forecasting to Bridge Ocean and Atmosphere

Add code
Jun 19, 2024
Viaarxiv icon

NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images

Add code
Jun 11, 2024
Figure 1 for NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images
Figure 2 for NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images
Figure 3 for NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images
Figure 4 for NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images
Viaarxiv icon

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Add code
Jun 10, 2024
Viaarxiv icon

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

Add code
Mar 10, 2024
Figure 1 for Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Figure 2 for Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Figure 3 for Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Figure 4 for Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Viaarxiv icon

Vision-language Assisted Attribute Learning

Add code
Dec 15, 2023
Figure 1 for Vision-language Assisted Attribute Learning
Figure 2 for Vision-language Assisted Attribute Learning
Figure 3 for Vision-language Assisted Attribute Learning
Figure 4 for Vision-language Assisted Attribute Learning
Viaarxiv icon

HumanRecon: Neural Reconstruction of Dynamic Human Using Geometric Cues and Physical Priors

Add code
Nov 26, 2023
Viaarxiv icon

DemoFusion: Democratising High-Resolution Image Generation With No $$$

Add code
Nov 24, 2023
Viaarxiv icon

Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition

Add code
Sep 18, 2023
Figure 1 for Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition
Figure 2 for Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition
Figure 3 for Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition
Figure 4 for Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition
Viaarxiv icon

LaDA: Latent Dialogue Action For Zero-shot Cross-lingual Neural Network Language Modeling

Add code
Aug 05, 2023
Figure 1 for LaDA: Latent Dialogue Action For Zero-shot Cross-lingual Neural Network Language Modeling
Figure 2 for LaDA: Latent Dialogue Action For Zero-shot Cross-lingual Neural Network Language Modeling
Figure 3 for LaDA: Latent Dialogue Action For Zero-shot Cross-lingual Neural Network Language Modeling
Figure 4 for LaDA: Latent Dialogue Action For Zero-shot Cross-lingual Neural Network Language Modeling
Viaarxiv icon

Super-Resolution Information Enhancement For Crowd Counting

Add code
Mar 13, 2023
Figure 1 for Super-Resolution Information Enhancement For Crowd Counting
Figure 2 for Super-Resolution Information Enhancement For Crowd Counting
Figure 3 for Super-Resolution Information Enhancement For Crowd Counting
Figure 4 for Super-Resolution Information Enhancement For Crowd Counting
Viaarxiv icon