Picture for Tao Wang

Tao Wang

Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation

Add code
Dec 11, 2024
Viaarxiv icon

ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection

Add code
Nov 15, 2024
Figure 1 for ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection
Figure 2 for ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection
Figure 3 for ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection
Figure 4 for ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection
Viaarxiv icon

Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing

Add code
Nov 09, 2024
Figure 1 for Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
Figure 2 for Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
Figure 3 for Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
Figure 4 for Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
Viaarxiv icon

Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation

Add code
Nov 08, 2024
Viaarxiv icon

Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation

Add code
Nov 07, 2024
Figure 1 for Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation
Figure 2 for Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation
Figure 3 for Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation
Figure 4 for Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation
Viaarxiv icon

Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning

Add code
Nov 06, 2024
Viaarxiv icon

Exploring structure diversity in atomic resolution microscopy with graph neural networks

Add code
Oct 23, 2024
Figure 1 for Exploring structure diversity in atomic resolution microscopy with graph neural networks
Figure 2 for Exploring structure diversity in atomic resolution microscopy with graph neural networks
Figure 3 for Exploring structure diversity in atomic resolution microscopy with graph neural networks
Figure 4 for Exploring structure diversity in atomic resolution microscopy with graph neural networks
Viaarxiv icon

Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

Add code
Oct 17, 2024
Figure 1 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 2 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 3 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 4 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Viaarxiv icon

DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech

Add code
Sep 18, 2024
Figure 1 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 2 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 3 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 4 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Viaarxiv icon

WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification

Add code
Sep 18, 2024
Figure 1 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 2 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 3 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 4 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Viaarxiv icon