Alert button
Picture for Frank Zhang

Frank Zhang

Alert button

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Add code
Bookmark button
Alert button
Mar 14, 2024
Frank Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou

Figure 1 for 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Figure 2 for 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Figure 3 for 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Figure 4 for 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Viaarxiv icon

Pushing the performances of ASR models on English and Spanish accents

Add code
Bookmark button
Alert button
Dec 22, 2022
Pooja Chitkara, Morgane Riviere, Jade Copet, Frank Zhang, Yatharth Saraf

Figure 1 for Pushing the performances of ASR models on English and Spanish accents
Figure 2 for Pushing the performances of ASR models on English and Spanish accents
Figure 3 for Pushing the performances of ASR models on English and Spanish accents
Figure 4 for Pushing the performances of ASR models on English and Spanish accents
Viaarxiv icon

Scaling ASR Improves Zero and Few Shot Learning

Add code
Bookmark button
Alert button
Nov 29, 2021
Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed

Figure 1 for Scaling ASR Improves Zero and Few Shot Learning
Figure 2 for Scaling ASR Improves Zero and Few Shot Learning
Figure 3 for Scaling ASR Improves Zero and Few Shot Learning
Figure 4 for Scaling ASR Improves Zero and Few Shot Learning
Viaarxiv icon

Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings

Add code
Bookmark button
Alert button
Oct 08, 2021
Jialu Li, Vimal Manohar, Pooja Chitkara, Andros Tjandra, Michael Picheny, Frank Zhang, Xiaohui Zhang, Yatharth Saraf

Figure 1 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 2 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 3 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 4 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Add code
Bookmark button
Alert button
Aug 04, 2021
Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli

Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models

Add code
Bookmark button
Alert button
Jul 09, 2021
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer

Figure 1 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 2 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 3 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 4 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Viaarxiv icon