Alert button

"Text": models, code, and papers
Alert button

T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up

Aug 18, 2022
Lin Wu, Yang Wang, Feng Zheng, Qi Tian, Meng Wang

Figure 1 for T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Figure 2 for T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Figure 3 for T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Figure 4 for T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Viaarxiv icon

Aesthetic Text Logo Synthesis via Content-aware Layout Inferring

Apr 06, 2022
Yizhi Wang, Guo Pu, Wenhan Luo, Yexin Wang, Pengfei Xiong, Hongwen Kang, Zhouhui Lian

Figure 1 for Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Figure 2 for Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Figure 3 for Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Figure 4 for Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Viaarxiv icon

Self-supervised vision-language pretraining for Medical visual question answering

Nov 24, 2022
Pengfei Li, Gang Liu, Lin Tan, Jinying Liao, Shenjun Zhong

Figure 1 for Self-supervised vision-language pretraining for Medical visual question answering
Figure 2 for Self-supervised vision-language pretraining for Medical visual question answering
Figure 3 for Self-supervised vision-language pretraining for Medical visual question answering
Figure 4 for Self-supervised vision-language pretraining for Medical visual question answering
Viaarxiv icon

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

Jan 18, 2023
Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu

Figure 1 for How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Figure 2 for How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Figure 3 for How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Figure 4 for How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Viaarxiv icon

Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries

Jan 10, 2023
Matyáš Boháček, Marek Hrúz

Figure 1 for Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries
Figure 2 for Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries
Figure 3 for Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries
Figure 4 for Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries
Viaarxiv icon

On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization

May 24, 2022
Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W Black, Ana Marasovic

Figure 1 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Figure 2 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Figure 3 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Figure 4 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Viaarxiv icon

X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval

Mar 28, 2022
Satya Krishna Gorti, Noel Vouitsis, Junwei Ma, Keyvan Golestan, Maksims Volkovs, Animesh Garg, Guangwei Yu

Figure 1 for X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
Figure 2 for X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
Figure 3 for X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
Figure 4 for X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
Viaarxiv icon

Retrieval-Augmented Multimodal Language Modeling

Nov 22, 2022
Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

Figure 1 for Retrieval-Augmented Multimodal Language Modeling
Figure 2 for Retrieval-Augmented Multimodal Language Modeling
Figure 3 for Retrieval-Augmented Multimodal Language Modeling
Figure 4 for Retrieval-Augmented Multimodal Language Modeling
Viaarxiv icon

Understanding Translationese in Cross-Lingual Summarization

Dec 14, 2022
Jiaan Wang, Fandong Meng, Tingyi Zhang, Yunlong Liang, Jiarong Xu, Zhixu Li, Jie Zhou

Figure 1 for Understanding Translationese in Cross-Lingual Summarization
Figure 2 for Understanding Translationese in Cross-Lingual Summarization
Figure 3 for Understanding Translationese in Cross-Lingual Summarization
Figure 4 for Understanding Translationese in Cross-Lingual Summarization
Viaarxiv icon

Two-stream Hierarchical Similarity Reasoning for Image-text Matching

Mar 10, 2022
Ran Chen, Hanli Wang, Lei Wang, Sam Kwong

Figure 1 for Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Figure 2 for Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Figure 3 for Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Figure 4 for Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Viaarxiv icon