Alert button

"Text": models, code, and papers
Alert button

FastCLIPStyler: Towards fast text-based image style transfer using style representation

Oct 07, 2022
Ananda Padhmanabhan Suresh, Sanjana Jain, Pavit Noinongyao, Ankush Ganguly

Figure 1 for FastCLIPStyler: Towards fast text-based image style transfer using style representation
Figure 2 for FastCLIPStyler: Towards fast text-based image style transfer using style representation
Figure 3 for FastCLIPStyler: Towards fast text-based image style transfer using style representation
Figure 4 for FastCLIPStyler: Towards fast text-based image style transfer using style representation
Viaarxiv icon

FAF: A novel multimodal emotion recognition approach integrating face, body and text

Nov 20, 2022
Zhongyu Fang, Aoyun He, Qihui Yu, Baopeng Gao, Weiping Ding, Tong Zhang, Lei Ma

Figure 1 for FAF: A novel multimodal emotion recognition approach integrating face, body and text
Figure 2 for FAF: A novel multimodal emotion recognition approach integrating face, body and text
Figure 3 for FAF: A novel multimodal emotion recognition approach integrating face, body and text
Figure 4 for FAF: A novel multimodal emotion recognition approach integrating face, body and text
Viaarxiv icon

Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling

Mar 20, 2023
Yongshuai Huang, Ning Lu, Dapeng Chen, Yibo Li, Zecheng Xie, Shenggao Zhu, Liangcai Gao, Wei Peng

Figure 1 for Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling
Figure 2 for Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling
Figure 3 for Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling
Figure 4 for Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling
Viaarxiv icon

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition

Mar 20, 2023
Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland

Figure 1 for Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Figure 2 for Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Figure 3 for Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Figure 4 for Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Viaarxiv icon

Object-Centric Slot Diffusion

Mar 20, 2023
Jindong Jiang, Fei Deng, Gautam Singh, Sungjin Ahn

Figure 1 for Object-Centric Slot Diffusion
Figure 2 for Object-Centric Slot Diffusion
Figure 3 for Object-Centric Slot Diffusion
Figure 4 for Object-Centric Slot Diffusion
Viaarxiv icon

Erasing Concepts from Diffusion Models

Mar 13, 2023
Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, David Bau

Figure 1 for Erasing Concepts from Diffusion Models
Figure 2 for Erasing Concepts from Diffusion Models
Figure 3 for Erasing Concepts from Diffusion Models
Figure 4 for Erasing Concepts from Diffusion Models
Viaarxiv icon

Generating Hierarchical Explanations on Text Classification Without Connecting Rules

Oct 24, 2022
Yiming Ju, Yuanzhe Zhang, Kang Liu, Jun Zhao

Figure 1 for Generating Hierarchical Explanations on Text Classification Without Connecting Rules
Figure 2 for Generating Hierarchical Explanations on Text Classification Without Connecting Rules
Figure 3 for Generating Hierarchical Explanations on Text Classification Without Connecting Rules
Figure 4 for Generating Hierarchical Explanations on Text Classification Without Connecting Rules
Viaarxiv icon

BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset

Mar 10, 2023
Md. Istiak Hossain Shihab, Md. Rakibul Hasan, Mahfuzur Rahman Emon, Syed Mobassir Hossen, Md. Nazmuddoha Ansary, Intesur Ahmed, Fazle Rabbi Rakib, Shahriar Elahi Dhruvo, Souhardya Saha Dip, Akib Hasan Pavel, Marsia Haque Meghla, Md. Rezwanul Haque, Sayma Sultana Chowdhury, Farig Sadeque, Tahsin Reasat, Ahmed Imtiaz Humayun, Asif Shahriyar Sushmit

Figure 1 for BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset
Figure 2 for BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset
Figure 3 for BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset
Figure 4 for BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset
Viaarxiv icon

A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition

Dec 01, 2022
Biao Ma, Chengben Xu, Ye Zhang

Figure 1 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 2 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 3 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 4 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Viaarxiv icon

Characterizing Financial Market Coverage using Artificial Intelligence

Feb 07, 2023
Jean Marie Tshimula, D'Jeff K. Nkashama, Patrick Owusu, Marc Frappier, Pierre-Martin Tardif, Froduald Kabanza, Armelle Brun, Jean-Marc Patenaude, Shengrui Wang, Belkacem Chikhaoui

Figure 1 for Characterizing Financial Market Coverage using Artificial Intelligence
Figure 2 for Characterizing Financial Market Coverage using Artificial Intelligence
Figure 3 for Characterizing Financial Market Coverage using Artificial Intelligence
Figure 4 for Characterizing Financial Market Coverage using Artificial Intelligence
Viaarxiv icon