Alert button
Picture for Haoxu Wang

Haoxu Wang

Alert button

Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices

Add code
Bookmark button
Alert button
Mar 19, 2024
Xueshuo Xie, Haoxu Wang, Zhaolong Jian, Tao Li, Wei Wang, Zhiwei Xu, Guiling Wang

Figure 1 for Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices
Figure 2 for Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices
Figure 3 for Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices
Figure 4 for Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices
Viaarxiv icon

Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer

Add code
Bookmark button
Alert button
Mar 04, 2024
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li

Figure 1 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 2 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 3 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 4 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Viaarxiv icon

LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 12, 2024
Fan Yu, Haoxu Wang, Xian Shi, Shiliang Zhang

Viaarxiv icon

Hourglass-AVSR: Down-Up Sampling-based Computational Efficiency Model for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Dec 14, 2023
Fan Yu, Haoxu Wang, Ziyang Ma, Shiliang Zhang

Viaarxiv icon

SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus

Add code
Bookmark button
Alert button
Sep 12, 2023
Haoxu Wang, Fan Yu, Xian Shi, Yuezhang Wang, Shiliang Zhang, Ming Li

Figure 1 for SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Figure 2 for SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Figure 3 for SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Figure 4 for SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Viaarxiv icon

The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis

Add code
Bookmark button
Alert button
Mar 04, 2023
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li

Figure 1 for The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis
Figure 2 for The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis
Figure 3 for The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis
Figure 4 for The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis
Viaarxiv icon

Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words

Add code
Bookmark button
Alert button
Jan 01, 2022
Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li

Figure 1 for Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words
Figure 2 for Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words
Figure 3 for Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words
Figure 4 for Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words
Viaarxiv icon