Alert button

"Text": models, code, and papers
Alert button

Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance

Feb 26, 2023
Yoonjeon Kim, Hyunsu Kim, Junho Kim, Yunjey Choi, Eunho Yang

Viaarxiv icon

RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models

Apr 21, 2023
Seulki Park, Daeho Um, Hajung Yoon, Sanghyuk Chun, Sangdoo Yun, Jin Young Choi

Figure 1 for RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models
Figure 2 for RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models
Figure 3 for RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models
Figure 4 for RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models
Viaarxiv icon

BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion

Jun 05, 2023
Ahana Deb, Sayan Nag, Ayan Mahapatra, Soumitri Chattopadhyay, Aritra Marik, Pijush Kanti Gayen, Shankha Sanyal, Archi Banerjee, Samir Karmakar

Figure 1 for BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion
Figure 2 for BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion
Figure 3 for BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion
Figure 4 for BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion
Viaarxiv icon

Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval

Jun 03, 2023
Xu Zhang, Zhedong Zheng, Xiaohan Wang, Yi Yang

Figure 1 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Figure 2 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Figure 3 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Figure 4 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Viaarxiv icon

Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization

Feb 24, 2023
Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Zhouhan Lin, Guanjie Zheng, Xinbing Wang

Figure 1 for Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization
Figure 2 for Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization
Figure 3 for Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization
Figure 4 for Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization
Viaarxiv icon

Scaling up GANs for Text-to-Image Synthesis

Mar 09, 2023
Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park

Figure 1 for Scaling up GANs for Text-to-Image Synthesis
Figure 2 for Scaling up GANs for Text-to-Image Synthesis
Figure 3 for Scaling up GANs for Text-to-Image Synthesis
Figure 4 for Scaling up GANs for Text-to-Image Synthesis
Viaarxiv icon

Improving CLIP Training with Language Rewrites

May 31, 2023
Lijie Fan, Dilip Krishnan, Phillip Isola, Dina Katabi, Yonglong Tian

Figure 1 for Improving CLIP Training with Language Rewrites
Figure 2 for Improving CLIP Training with Language Rewrites
Figure 3 for Improving CLIP Training with Language Rewrites
Figure 4 for Improving CLIP Training with Language Rewrites
Viaarxiv icon

Sequentially Controlled Text Generation

Jan 05, 2023
Alexander Spangher, Xinyu Hua, Yao Ming, Nanyun Peng

Figure 1 for Sequentially Controlled Text Generation
Figure 2 for Sequentially Controlled Text Generation
Figure 3 for Sequentially Controlled Text Generation
Figure 4 for Sequentially Controlled Text Generation
Viaarxiv icon

FLuRKA: Fast fused Low-Rank & Kernel Attention

Jun 27, 2023
Ahan Gupta, Yueming Yuan, Yanqi Zhou, Charith Mendis

Figure 1 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 2 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 3 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 4 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Viaarxiv icon

Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation

Jun 22, 2023
Julien Romero, Simon Razniewski

Figure 1 for Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation
Figure 2 for Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation
Figure 3 for Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation
Figure 4 for Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation
Viaarxiv icon