Alert button
Picture for Yuxuan Wang

Yuxuan Wang

Alert button

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

Add code
Bookmark button
Alert button
Aug 24, 2022
Stan Weixian Lei, Difei Gao, Jay Zhangjie Wu, Yuxuan Wang, Wei Liu, Mengmi Zhang, Mike Zheng Shou

Figure 1 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Figure 2 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Figure 3 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Figure 4 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Viaarxiv icon

A Piecewise Monotonic Gait Phase Estimation Model for Controlling a Powered Transfemoral Prosthesis in Various Locomotion Modes

Add code
Bookmark button
Alert button
Jul 25, 2022
Xinxing Chen, Chuheng Chen, Yuxuan Wang, Bowen Yang, Teng Ma, Yuquan Leng, Chenglong Fu

Figure 1 for A Piecewise Monotonic Gait Phase Estimation Model for Controlling a Powered Transfemoral Prosthesis in Various Locomotion Modes
Figure 2 for A Piecewise Monotonic Gait Phase Estimation Model for Controlling a Powered Transfemoral Prosthesis in Various Locomotion Modes
Figure 3 for A Piecewise Monotonic Gait Phase Estimation Model for Controlling a Powered Transfemoral Prosthesis in Various Locomotion Modes
Figure 4 for A Piecewise Monotonic Gait Phase Estimation Model for Controlling a Powered Transfemoral Prosthesis in Various Locomotion Modes
Viaarxiv icon

Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech

Add code
Bookmark button
Alert button
Jul 13, 2022
Zhengxi Liu, Qiao Tian, Chenxu Hu, Xudong Liu, Menglin Wu, Yuping Wang, Hang Zhao, Yuxuan Wang

Figure 1 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Figure 2 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Figure 3 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Figure 4 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Viaarxiv icon

SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation

Add code
Bookmark button
Alert button
Jun 16, 2022
Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc Van Gool, Bernt Schiele, Federico Tombari, Fisher Yu

Figure 1 for SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
Figure 2 for SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
Figure 3 for SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
Figure 4 for SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
Viaarxiv icon

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration

Add code
Bookmark button
Alert button
Apr 17, 2022
Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang

Figure 1 for VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Figure 2 for VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Figure 3 for VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Figure 4 for VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Viaarxiv icon

GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval

Add code
Bookmark button
Alert button
Apr 10, 2022
Yuxuan Wang, Difei Gao, Licheng Yu, Stan Weixian Lei, Matt Feiszli, Mike Zheng Shou

Figure 1 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Figure 2 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Figure 3 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Figure 4 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Viaarxiv icon

NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism

Add code
Bookmark button
Alert button
Mar 31, 2022
Jingbei Li, Yi Meng, Zhiyong Wu, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Figure 2 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Figure 3 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Figure 4 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Viaarxiv icon

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge

Add code
Bookmark button
Alert button
Feb 10, 2022
Maokui He, Xiang Lv, Weilin Zhou, JingJing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee

Figure 1 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
Figure 2 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
Figure 3 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
Figure 4 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
Viaarxiv icon

AssistSR: Affordance-centric Question-driven Video Segment Retrieval

Add code
Bookmark button
Alert button
Dec 06, 2021
Stan Weixian Lei, Yuxuan Wang, Dongxing Mao, Difei Gao, Mike Zheng Shou

Figure 1 for AssistSR: Affordance-centric Question-driven Video Segment Retrieval
Figure 2 for AssistSR: Affordance-centric Question-driven Video Segment Retrieval
Figure 3 for AssistSR: Affordance-centric Question-driven Video Segment Retrieval
Figure 4 for AssistSR: Affordance-centric Question-driven Video Segment Retrieval
Viaarxiv icon