Alert button

"Text": models, code, and papers
Alert button

Prosodic features improve sentence segmentation and parsing

Feb 23, 2023
Elizabeth Nielsen, Sharon Goldwater, Mark Steedman

Figure 1 for Prosodic features improve sentence segmentation and parsing
Figure 2 for Prosodic features improve sentence segmentation and parsing
Figure 3 for Prosodic features improve sentence segmentation and parsing
Figure 4 for Prosodic features improve sentence segmentation and parsing
Viaarxiv icon

An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper

Sep 23, 2022
Kostiantyn Kucher, Nicole Sultanum, Angel Daza, Vasiliki Simaki, Maria Skeppstedt, Barbara Plank, Jean-Daniel Fekete, Narges Mahyar

Figure 1 for An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper
Figure 2 for An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper
Figure 3 for An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper
Viaarxiv icon

Multi-modal Machine Learning in Engineering Design: A Review and Future Directions

Feb 14, 2023
Binyang Song, Rui Zhou, Faez Ahmed

Figure 1 for Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Figure 2 for Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Figure 3 for Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Figure 4 for Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Viaarxiv icon

Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval

Feb 06, 2023
Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister

Figure 1 for Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Figure 2 for Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Figure 3 for Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Figure 4 for Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Viaarxiv icon

UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis

Dec 06, 2022
Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, Dan Su

Figure 1 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 2 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 3 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 4 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Viaarxiv icon

The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images

Nov 14, 2022
Philip Müller, Georgios Kaissis, Daniel Rueckert

Figure 1 for The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images
Figure 2 for The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images
Figure 3 for The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images
Figure 4 for The Role of Local Alignment and Uniformity in Image-Text Contrastive Learning on Medical Images
Viaarxiv icon

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Oct 27, 2022
Jingyi li, Weiping tu, Li xiao

Figure 1 for FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Figure 2 for FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Figure 3 for FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Figure 4 for FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Viaarxiv icon

Single-branch Network for Multimodal Training

Mar 10, 2023
Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood

Figure 1 for Single-branch Network for Multimodal Training
Figure 2 for Single-branch Network for Multimodal Training
Figure 3 for Single-branch Network for Multimodal Training
Figure 4 for Single-branch Network for Multimodal Training
Viaarxiv icon

Synthesizing Mixed-type Electronic Health Records using Diffusion Models

Feb 28, 2023
Taha Ceritli, Ghadeer O. Ghosheh, Vinod Kumar Chauhan, Tingting Zhu, Andrew P. Creagh, David A. Clifton

Figure 1 for Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Figure 2 for Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Figure 3 for Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Figure 4 for Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Viaarxiv icon

CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval

Aug 21, 2022
Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang

Figure 1 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Figure 2 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Figure 3 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Figure 4 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Viaarxiv icon