Alert button

"Text": models, code, and papers
Alert button

QASem Parsing: Text-to-text Modeling of QA-based Semantics

May 23, 2022
Ayal Klein, Eran Hirsch, Ron Eliav, Valentina Pyatkin, Avi Caciularu, Ido Dagan

Figure 1 for QASem Parsing: Text-to-text Modeling of QA-based Semantics
Figure 2 for QASem Parsing: Text-to-text Modeling of QA-based Semantics
Figure 3 for QASem Parsing: Text-to-text Modeling of QA-based Semantics
Figure 4 for QASem Parsing: Text-to-text Modeling of QA-based Semantics
Viaarxiv icon

MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis

Dec 08, 2022
Rishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik, Christian Theobalt

Figure 1 for MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Figure 2 for MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Figure 3 for MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Figure 4 for MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Viaarxiv icon

DialogCC: Large-Scale Multi-Modal Dialogue Dataset

Dec 08, 2022
Young-Jun Lee, Byungsoo Ko, Han-Gyu Kim, Ho-Jin Choi

Figure 1 for DialogCC: Large-Scale Multi-Modal Dialogue Dataset
Figure 2 for DialogCC: Large-Scale Multi-Modal Dialogue Dataset
Figure 3 for DialogCC: Large-Scale Multi-Modal Dialogue Dataset
Figure 4 for DialogCC: Large-Scale Multi-Modal Dialogue Dataset
Viaarxiv icon

Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning

Jul 03, 2022
Jinze Li, Yan Bai, Yihang Lou, Xiongkun Linghu, Jianzhong He, Shaoyun Xu, Tao Bai

Figure 1 for Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning
Figure 2 for Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning
Figure 3 for Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning
Figure 4 for Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning
Viaarxiv icon

On Metric Learning for Audio-Text Cross-Modal Retrieval

Apr 13, 2022
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang

Figure 1 for On Metric Learning for Audio-Text Cross-Modal Retrieval
Figure 2 for On Metric Learning for Audio-Text Cross-Modal Retrieval
Viaarxiv icon

DePlot: One-shot visual language reasoning by plot-to-table translation

Dec 20, 2022
Fangyu Liu, Julian Martin Eisenschlos, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Wenhu Chen, Nigel Collier, Yasemin Altun

Figure 1 for DePlot: One-shot visual language reasoning by plot-to-table translation
Figure 2 for DePlot: One-shot visual language reasoning by plot-to-table translation
Figure 3 for DePlot: One-shot visual language reasoning by plot-to-table translation
Figure 4 for DePlot: One-shot visual language reasoning by plot-to-table translation
Viaarxiv icon

TTS-Guided Training for Accent Conversion Without Parallel Data

Dec 20, 2022
Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li

Figure 1 for TTS-Guided Training for Accent Conversion Without Parallel Data
Figure 2 for TTS-Guided Training for Accent Conversion Without Parallel Data
Figure 3 for TTS-Guided Training for Accent Conversion Without Parallel Data
Figure 4 for TTS-Guided Training for Accent Conversion Without Parallel Data
Viaarxiv icon

Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures

Nov 14, 2022
Gal Metzer, Elad Richardson, Or Patashnik, Raja Giryes, Daniel Cohen-Or

Figure 1 for Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Figure 2 for Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Figure 3 for Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Figure 4 for Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Viaarxiv icon

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

Jun 05, 2022
Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye

Figure 1 for Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Figure 2 for Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Figure 3 for Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Figure 4 for Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Viaarxiv icon

Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining

Jan 18, 2023
Karol Nowakowski, Michal Ptaszynski, Kyoko Murasaki, Jagna Nieuważny

Figure 1 for Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Figure 2 for Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Figure 3 for Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Figure 4 for Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Viaarxiv icon