Alert button
Picture for David Cox

David Cox

Alert button

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Add code
Bookmark button
Alert button
Oct 04, 2021
Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David Cox, James Glass

Figure 1 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 2 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 3 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 4 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Viaarxiv icon

Global Rhythm Style Transfer Without Text Transcriptions

Add code
Bookmark button
Alert button
Jun 16, 2021
Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David Cox, Mark Hasegawa-Johnson

Figure 1 for Global Rhythm Style Transfer Without Text Transcriptions
Figure 2 for Global Rhythm Style Transfer Without Text Transcriptions
Figure 3 for Global Rhythm Style Transfer Without Text Transcriptions
Figure 4 for Global Rhythm Style Transfer Without Text Transcriptions
Viaarxiv icon

Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators

Add code
Bookmark button
Alert button
Jun 11, 2021
Yonggan Fu, Yongan Zhang, Yang Zhang, David Cox, Yingyan Lin

Figure 1 for Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators
Figure 2 for Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators
Figure 3 for Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators
Figure 4 for Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators
Viaarxiv icon

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition

Add code
Bookmark button
Alert button
Jun 10, 2021
Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David Cox, James Glass

Figure 1 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 2 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 3 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 4 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Viaarxiv icon

Lifelong Object Detection

Add code
Bookmark button
Alert button
Sep 02, 2020
Wang Zhou, Shiyu Chang, Norma Sosa, Hendrik Hamann, David Cox

Figure 1 for Lifelong Object Detection
Figure 2 for Lifelong Object Detection
Figure 3 for Lifelong Object Detection
Figure 4 for Lifelong Object Detection
Viaarxiv icon

ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

Add code
Bookmark button
Alert button
Jul 09, 2020
Chuang Gan, Jeremy Schwartz, Seth Alter, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Damian Mrowca, Michael Lingelbach, Aidan Curtis, Kevin Feigelis, Daniel M. Bear, Dan Gutfreund, David Cox, James J. DiCarlo, Josh McDermott, Joshua B. Tenenbaum, Daniel L. K. Yamins

Figure 1 for ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Figure 2 for ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Figure 3 for ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Figure 4 for ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Viaarxiv icon

Unsupervised Speech Decomposition via Triple Information Bottleneck

Add code
Bookmark button
Alert button
May 04, 2020
Kaizhi Qian, Yang Zhang, Shiyu Chang, David Cox, Mark Hasegawa-Johnson

Figure 1 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Figure 2 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Figure 3 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Figure 4 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Viaarxiv icon

More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation

Add code
Bookmark button
Alert button
Dec 02, 2019
Quanfu Fan, Chun-Fu Chen, Hilde Kuehne, Marco Pistoia, David Cox

Figure 1 for More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Figure 2 for More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Figure 3 for More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Figure 4 for More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Viaarxiv icon