Picture for Tao Wang

Tao Wang

Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems

Add code
Jun 27, 2024
Viaarxiv icon

A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction

Add code
Jun 26, 2024
Figure 1 for A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction
Figure 2 for A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction
Figure 3 for A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction
Figure 4 for A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction
Viaarxiv icon

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation

Add code
Jun 15, 2024
Figure 1 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Figure 2 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Figure 3 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Figure 4 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Viaarxiv icon

CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder

Add code
Jun 09, 2024
Figure 1 for CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder
Figure 2 for CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder
Figure 3 for CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder
Figure 4 for CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder
Viaarxiv icon

PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation

Add code
Jun 07, 2024
Figure 1 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Figure 2 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Figure 3 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Figure 4 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Viaarxiv icon

TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Add code
Jun 07, 2024
Viaarxiv icon

DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback Alignment

Add code
Jun 04, 2024
Viaarxiv icon

Mollification Effects of Policy Gradient Methods

Add code
May 28, 2024
Viaarxiv icon

Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression

Add code
May 21, 2024
Viaarxiv icon

Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels

Add code
May 15, 2024
Figure 1 for Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels
Figure 2 for Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels
Figure 3 for Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels
Figure 4 for Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels
Viaarxiv icon