Alert button

"Text": models, code, and papers
Alert button

SAGE: Structured Attribute Value Generation for Billion-Scale Product Catalogs

Sep 12, 2023
Athanasios N. Nikolakopoulos, Swati Kaul, Siva Karthik Gade, Bella Dubrov, Umit Batur, Suleiman Ali Khan

Viaarxiv icon

Wi-BFI: Extracting the IEEE 802.11 Beamforming Feedback Information from Commercial Wi-Fi Devices

Sep 12, 2023
Khandaker Foysal Haque, Francesca Meneghello, Francesco Restuccia

Figure 1 for Wi-BFI: Extracting the IEEE 802.11 Beamforming Feedback Information from Commercial Wi-Fi Devices
Figure 2 for Wi-BFI: Extracting the IEEE 802.11 Beamforming Feedback Information from Commercial Wi-Fi Devices
Figure 3 for Wi-BFI: Extracting the IEEE 802.11 Beamforming Feedback Information from Commercial Wi-Fi Devices
Figure 4 for Wi-BFI: Extracting the IEEE 802.11 Beamforming Feedback Information from Commercial Wi-Fi Devices
Viaarxiv icon

Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data

Sep 12, 2023
Hyungseob Lim, Kyungguen Byun, Sunkuk Moon, Erik Visser

Figure 1 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Figure 2 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Figure 3 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Figure 4 for Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Viaarxiv icon

TextDiffuser: Diffusion Models as Text Painters

May 24, 2023
Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

Figure 1 for TextDiffuser: Diffusion Models as Text Painters
Figure 2 for TextDiffuser: Diffusion Models as Text Painters
Figure 3 for TextDiffuser: Diffusion Models as Text Painters
Figure 4 for TextDiffuser: Diffusion Models as Text Painters
Viaarxiv icon

Controllable Emphasis with zero data for text-to-speech

Jul 13, 2023
Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova

Figure 1 for Controllable Emphasis with zero data for text-to-speech
Figure 2 for Controllable Emphasis with zero data for text-to-speech
Figure 3 for Controllable Emphasis with zero data for text-to-speech
Figure 4 for Controllable Emphasis with zero data for text-to-speech
Viaarxiv icon

Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?

Jul 15, 2023
Jialu Gao, Kaizhe Hu, Guowei Xu, Huazhe Xu

Viaarxiv icon

UTRNet: High-Resolution Urdu Text Recognition In Printed Documents

Jun 27, 2023
Abdur Rahman, Arjun Ghosh, Chetan Arora

Figure 1 for UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Figure 2 for UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Figure 3 for UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Figure 4 for UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Viaarxiv icon

Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training

Jul 10, 2023
Dima Galat, Marian-Andrei Rizoiu

Figure 1 for Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training
Figure 2 for Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training
Figure 3 for Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training
Figure 4 for Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training
Viaarxiv icon

Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages

Aug 30, 2023
Baban Gain, Dibyanayan Bandyopadhyay, Samrat Mukherjee, Chandranath Adak, Asif Ekbal

Figure 1 for Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Figure 2 for Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Figure 3 for Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Figure 4 for Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Viaarxiv icon

SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data

Aug 24, 2023
Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente Ordonez

Figure 1 for SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
Figure 2 for SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
Figure 3 for SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
Figure 4 for SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
Viaarxiv icon