Alert button

"Text": models, code, and papers
Alert button

Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition

Nov 07, 2022
Yashesh Gaur, Nick Kibre, Jian Xue, Kangyuan Shu, Yuhui Wang, Issac Alphanso, Jinyu Li, Yifan Gong

Figure 1 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 2 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 3 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 4 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Viaarxiv icon

An Explicit Expansion of the Kullback-Leibler Divergence along its Fisher-Rao Gradient Flow

Feb 23, 2023
Carles Domingo-Enrich, Aram-Alexandre Pooladian

Figure 1 for An Explicit Expansion of the Kullback-Leibler Divergence along its Fisher-Rao Gradient Flow
Figure 2 for An Explicit Expansion of the Kullback-Leibler Divergence along its Fisher-Rao Gradient Flow
Figure 3 for An Explicit Expansion of the Kullback-Leibler Divergence along its Fisher-Rao Gradient Flow
Viaarxiv icon

Retrieval-Augmented Classification with Decoupled Representation

Mar 23, 2023
Xinnian Liang, Shuangzhi Wu, Hui Huang, Jiaqi Bai, Chao Bian, Zhoujun Li

Figure 1 for Retrieval-Augmented Classification with Decoupled Representation
Figure 2 for Retrieval-Augmented Classification with Decoupled Representation
Figure 3 for Retrieval-Augmented Classification with Decoupled Representation
Figure 4 for Retrieval-Augmented Classification with Decoupled Representation
Viaarxiv icon

A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data

Sep 06, 2022
Zhengmi Tang, Tomo Miyazaki, Shinichiro Omachi

Figure 1 for A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data
Figure 2 for A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data
Figure 3 for A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data
Figure 4 for A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data
Viaarxiv icon

ClipFace: Text-guided Editing of Textured 3D Morphable Models

Dec 02, 2022
Shivangi Aneja, Justus Thies, Angela Dai, Matthias Nießner

Figure 1 for ClipFace: Text-guided Editing of Textured 3D Morphable Models
Figure 2 for ClipFace: Text-guided Editing of Textured 3D Morphable Models
Figure 3 for ClipFace: Text-guided Editing of Textured 3D Morphable Models
Figure 4 for ClipFace: Text-guided Editing of Textured 3D Morphable Models
Viaarxiv icon

Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases

Mar 20, 2023
Emma Bluemke, Tantum Collins, Ben Garfinkel, Andrew Trask

Figure 1 for Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases
Viaarxiv icon

Pluralistic Aging Diffusion Autoencoder

Mar 20, 2023
Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He

Figure 1 for Pluralistic Aging Diffusion Autoencoder
Figure 2 for Pluralistic Aging Diffusion Autoencoder
Figure 3 for Pluralistic Aging Diffusion Autoencoder
Figure 4 for Pluralistic Aging Diffusion Autoencoder
Viaarxiv icon

SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech

Nov 30, 2022
Byoung Jin Choi, Myeonghun Jeong, Joun Yeop Lee, Nam Soo Kim

Figure 1 for SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
Figure 2 for SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
Viaarxiv icon

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages

Mar 30, 2023
Zheng-Xin Yong, Ruochen Zhang, Jessica Zosa Forde, Skyler Wang, Samuel Cahyawijaya, Holy Lovenia, Genta Indra Winata, Lintang Sutawika, Jan Christian Blaise Cruz, Long Phan, Yin Lin Tan, Alham Fikri Aji

Figure 1 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Figure 2 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Figure 3 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Figure 4 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Viaarxiv icon

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Mar 30, 2023
Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Nicu Sebe, Trevor Darrell, Zhangyang Wang, Humphrey Shi

Figure 1 for PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models
Figure 2 for PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models
Figure 3 for PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models
Figure 4 for PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models
Viaarxiv icon