Alert button

"Text": models, code, and papers
Alert button

Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers

May 21, 2023
Linyuan Gong, Chenyan Xiong, Xiaodong Liu, Payal Bajaj, Yiqing Xie, Alvin Cheung, Jianfeng Gao, Xia Song

Figure 1 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Figure 2 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Figure 3 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Figure 4 for Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Viaarxiv icon

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation

May 29, 2023
Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Figure 2 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Figure 3 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Figure 4 for Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Viaarxiv icon

A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision

Aug 15, 2023
Julio Silva-Rodriguez, Hadi Chakor, Riadh Kobbi, Jose Dolz, Ismail Ben Ayed

Figure 1 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Figure 2 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Figure 3 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Figure 4 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Viaarxiv icon

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

May 31, 2023
Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, Kwan-Yee K. Wong

Figure 1 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Figure 2 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Figure 3 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Figure 4 for Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Viaarxiv icon

Scaled-up Discovery of Latent Concepts in Deep NLP Models

Aug 20, 2023
Majd Hawasly, Fahim Dalvi, Nadir Durrani

Figure 1 for Scaled-up Discovery of Latent Concepts in Deep NLP Models
Figure 2 for Scaled-up Discovery of Latent Concepts in Deep NLP Models
Figure 3 for Scaled-up Discovery of Latent Concepts in Deep NLP Models
Figure 4 for Scaled-up Discovery of Latent Concepts in Deep NLP Models
Viaarxiv icon

M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce

Aug 22, 2023
Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li, Rongrong Ji

Figure 1 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 2 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 3 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 4 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Viaarxiv icon

WEARS: Wearable Emotion AI with Real-time Sensor data

Aug 22, 2023
Dhruv Limbani, Daketi Yatin, Nitish Chaturvedi, Vaishnavi Moorthy, Pushpalatha M, Harichandana BSS, Sumit Kumar

Viaarxiv icon

Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

May 29, 2023
Aiwei Liu, Wei Liu, Xuming Hu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen

Figure 1 for Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing
Figure 2 for Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing
Figure 3 for Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing
Figure 4 for Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing
Viaarxiv icon

NBIAS: A Natural Language Processing Framework for Bias Identification in Text

Aug 08, 2023
Shaina Raza, Muskan Garg, Deepak John Reji, Syed Raza Bashir, Chen Ding

Figure 1 for NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Figure 2 for NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Figure 3 for NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Figure 4 for NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Viaarxiv icon

ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

Jun 04, 2023
Minghao Fu, Xin Man, Yihan Xu, Jie Shao

Figure 1 for ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes
Figure 2 for ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes
Figure 3 for ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes
Figure 4 for ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes
Viaarxiv icon