Alert button

"Text": models, code, and papers
Alert button

Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts

Oct 16, 2023
Christina Chance, Da Yin, Dakuo Wang, Kai-Wei Chang

Viaarxiv icon

Scene Graph Conditioning in Latent Diffusion

Oct 16, 2023
Frank Fundel

Viaarxiv icon

RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model

Sep 02, 2023
Fengxiang Bie, Yibo Yang, Zhongzhu Zhou, Adam Ghanem, Minjia Zhang, Zhewei Yao, Xiaoxia Wu, Connor Holmes, Pareesa Golnari, David A. Clifton, Yuxiong He, Dacheng Tao, Shuaiwen Leon Song

Viaarxiv icon

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models

Oct 13, 2023
Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok

Figure 1 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Figure 2 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Figure 3 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Figure 4 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Viaarxiv icon

Towards contrast-agnostic soft segmentation of the spinal cord

Oct 23, 2023
Sandrine Bédard, Naga Karthik Enamundram, Charidimos Tsagkas, Emanuele Pravatà, Cristina Granziera, Andrew Smith, Kenneth Arnold Weber II, Julien Cohen-Adad

Viaarxiv icon

Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search

Sep 28, 2023
Yuanmin Tang, Jing Yu, Keke Gai, Yujing Wang, Yue Hu, Gang Xiong, Qi Wu

Figure 1 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Figure 2 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Figure 3 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Figure 4 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Viaarxiv icon

Fine-grained Conversational Decoding via Isotropic and Proximal Search

Oct 23, 2023
Yuxuan Yao, Han Wu, Qiling Xu, Linqi Song

Figure 1 for Fine-grained Conversational Decoding via Isotropic and Proximal Search
Figure 2 for Fine-grained Conversational Decoding via Isotropic and Proximal Search
Figure 3 for Fine-grained Conversational Decoding via Isotropic and Proximal Search
Figure 4 for Fine-grained Conversational Decoding via Isotropic and Proximal Search
Viaarxiv icon

LXMERT Model Compression for Visual Question Answering

Oct 23, 2023
Maryam Hashemi, Ghazaleh Mahmoudi, Sara Kodeiri, Hadi Sheikhi, Sauleh Eetemadi

Viaarxiv icon

Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study

Oct 23, 2023
Injy Hamed, Nizar Habash, Ngoc Thang Vu

Viaarxiv icon

Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved

Sep 14, 2023
Yao Sun, Anna Kruspe, Liqiu Meng, Yifan Tian, Eike J Hoffmann, Stefan Auer, Xiao Xiang Zhu

Figure 1 for Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Figure 2 for Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Figure 3 for Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Figure 4 for Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Viaarxiv icon