Alert button

"Text": models, code, and papers
Alert button

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation

Oct 11, 2023
Xinfa Zhu, Yuanjun Lv, Yi Lei, Tao Li, Wendi He, Hongbin Zhou, Heng Lu, Lei Xie

Viaarxiv icon

Controllable Data Generation Via Iterative Data-Property Mutual Mappings

Oct 11, 2023
Bo Pan, Muran Qin, Shiyu Wang, Yifei Zhang, Liang Zhao

Figure 1 for Controllable Data Generation Via Iterative Data-Property Mutual Mappings
Figure 2 for Controllable Data Generation Via Iterative Data-Property Mutual Mappings
Figure 3 for Controllable Data Generation Via Iterative Data-Property Mutual Mappings
Figure 4 for Controllable Data Generation Via Iterative Data-Property Mutual Mappings
Viaarxiv icon

Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases

Oct 11, 2023
Xinpeng Liu, Yong-Lu Li, Ailing Zeng, Zizheng Zhou, Yang You, Cewu Lu

Figure 1 for Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases
Figure 2 for Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases
Figure 3 for Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases
Figure 4 for Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases
Viaarxiv icon

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Oct 05, 2023
Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi

Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

Oct 05, 2023
Jae-Sung Bae, Joun Yeop Lee, Ji-Hyun Lee, Seongkyu Mun, Taehwa Kang, Hoon-Young Cho, Chanwoo Kim

Figure 1 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 2 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 3 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 4 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Viaarxiv icon

FELM: Benchmarking Factuality Evaluation of Large Language Models

Oct 01, 2023
Shiqi Chen, Yiran Zhao, Jinghan Zhang, I-Chun Chern, Siyang Gao, Pengfei Liu, Junxian He

Viaarxiv icon

LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation

Sep 21, 2023
Jennifer A Bishop, Qianqian Xie, Sophia Ananiadou

Viaarxiv icon

Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge

Oct 08, 2023
John Chong Min Tan, Mehul Motani

Viaarxiv icon

Generative Spoken Language Model based on continuous word-sized audio tokens

Oct 08, 2023
Robin Algayres, Yossi Adi, Tu Anh Nguyen, Jade Copet, Gabriel Synnaeve, Benoit Sagot, Emmanuel Dupoux

Viaarxiv icon

Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023

Oct 10, 2023
Xiangyu Wu, Yang Yang, Shengdong Xu, Yifeng Wu, Qingguo Chen, Jianfeng Lu

Figure 1 for Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023
Figure 2 for Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023
Figure 3 for Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023
Figure 4 for Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023
Viaarxiv icon