Alert button

"Text": models, code, and papers
Alert button

Abstractive Summarization Guided by Latent Hierarchical Document Structure

Nov 17, 2022
Yifu Qiu, Shay B. Cohen

Viaarxiv icon

Hey ASR System! Why Aren't You More Inclusive? Automatic Speech Recognition Systems' Bias and Proposed Bias Mitigation Techniques. A Literature Review

Nov 17, 2022
Mikel K. Ngueajio, Gloria Washington

Viaarxiv icon

Improving Audio-Language Learning with MixGen and Multi-Level Test-Time Augmentation

Oct 31, 2022
Eungbeom Kim, Jinhee Kim, Yoori Oh, Kyungsu Kim, Minju Park, Jaeheon Sim, Jinwoo Lee, Kyogu Lee

Figure 1 for Improving Audio-Language Learning with MixGen and Multi-Level Test-Time Augmentation
Figure 2 for Improving Audio-Language Learning with MixGen and Multi-Level Test-Time Augmentation
Figure 3 for Improving Audio-Language Learning with MixGen and Multi-Level Test-Time Augmentation
Figure 4 for Improving Audio-Language Learning with MixGen and Multi-Level Test-Time Augmentation
Viaarxiv icon

Two Is Better Than One: Dual Embeddings for Complementary Product Recommendations

Nov 29, 2022
Giorgi Kvernadze, Putu Ayu G. Sudyanti, Nishan Subedi, Mohammad Hajiaghayi

Figure 1 for Two Is Better Than One: Dual Embeddings for Complementary Product Recommendations
Figure 2 for Two Is Better Than One: Dual Embeddings for Complementary Product Recommendations
Figure 3 for Two Is Better Than One: Dual Embeddings for Complementary Product Recommendations
Figure 4 for Two Is Better Than One: Dual Embeddings for Complementary Product Recommendations
Viaarxiv icon

Integration of Text and Graph-based Features for Detecting Mental Health Disorders from Voice

May 14, 2022
Nasser Ghadiri, Rasoul Samani, Fahime Shahrokh

Figure 1 for Integration of Text and Graph-based Features for Detecting Mental Health Disorders from Voice
Figure 2 for Integration of Text and Graph-based Features for Detecting Mental Health Disorders from Voice
Figure 3 for Integration of Text and Graph-based Features for Detecting Mental Health Disorders from Voice
Figure 4 for Integration of Text and Graph-based Features for Detecting Mental Health Disorders from Voice
Viaarxiv icon

Leveraging per Image-Token Consistency for Vision-Language Pre-training

Nov 20, 2022
Yunhao Gou, Tom Ko, Hansi Yang, James Kwok, Yu Zhang, Mingxuan Wang

Figure 1 for Leveraging per Image-Token Consistency for Vision-Language Pre-training
Figure 2 for Leveraging per Image-Token Consistency for Vision-Language Pre-training
Figure 3 for Leveraging per Image-Token Consistency for Vision-Language Pre-training
Figure 4 for Leveraging per Image-Token Consistency for Vision-Language Pre-training
Viaarxiv icon

Feature Weaken: Vicinal Data Augmentation for Classification

Nov 20, 2022
Songhao Jiang, Yan Chu, Tianxing Ma, Tianning Zang

Viaarxiv icon

AvatarGen: A 3D Generative Model for Animatable Human Avatars

Nov 26, 2022
Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng

Figure 1 for AvatarGen: A 3D Generative Model for Animatable Human Avatars
Figure 2 for AvatarGen: A 3D Generative Model for Animatable Human Avatars
Figure 3 for AvatarGen: A 3D Generative Model for Animatable Human Avatars
Figure 4 for AvatarGen: A 3D Generative Model for Animatable Human Avatars
Viaarxiv icon

Khmer Text Classification Using Word Embedding and Neural Networks

Dec 13, 2021
Rina Buoy, Nguonly Taing, Sovisal Chenda

Figure 1 for Khmer Text Classification Using Word Embedding and Neural Networks
Figure 2 for Khmer Text Classification Using Word Embedding and Neural Networks
Figure 3 for Khmer Text Classification Using Word Embedding and Neural Networks
Figure 4 for Khmer Text Classification Using Word Embedding and Neural Networks
Viaarxiv icon

Arbitrary-Shaped Text Detection withAdaptive Text Region Representation

Apr 01, 2021
Xiufeng Jiang, Shugong Xu, Shunqing Zhang, Shan Cao

Figure 1 for Arbitrary-Shaped Text Detection withAdaptive Text Region Representation
Figure 2 for Arbitrary-Shaped Text Detection withAdaptive Text Region Representation
Figure 3 for Arbitrary-Shaped Text Detection withAdaptive Text Region Representation
Figure 4 for Arbitrary-Shaped Text Detection withAdaptive Text Region Representation
Viaarxiv icon