Alert button

"Text": models, code, and papers
Alert button

Dense Text-to-Image Generation with Attention Modulation

Aug 24, 2023
Yunji Kim, Jiyoung Lee, Jin-Hwa Kim, Jung-Woo Ha, Jun-Yan Zhu

Figure 1 for Dense Text-to-Image Generation with Attention Modulation
Figure 2 for Dense Text-to-Image Generation with Attention Modulation
Figure 3 for Dense Text-to-Image Generation with Attention Modulation
Figure 4 for Dense Text-to-Image Generation with Attention Modulation
Viaarxiv icon

AE-smnsMLC: Multi-Label Classification with Semantic Matching and Negative Label Sampling for Product Attribute Value Extraction

Oct 11, 2023
Zhongfen Deng, Wei-Te Chen, Lei Chen, Philip S. Yu

Figure 1 for AE-smnsMLC: Multi-Label Classification with Semantic Matching and Negative Label Sampling for Product Attribute Value Extraction
Figure 2 for AE-smnsMLC: Multi-Label Classification with Semantic Matching and Negative Label Sampling for Product Attribute Value Extraction
Figure 3 for AE-smnsMLC: Multi-Label Classification with Semantic Matching and Negative Label Sampling for Product Attribute Value Extraction
Figure 4 for AE-smnsMLC: Multi-Label Classification with Semantic Matching and Negative Label Sampling for Product Attribute Value Extraction
Viaarxiv icon

Generative Pre-training for Speech with Flow Matching

Oct 25, 2023
Alexander H. Liu, Matt Le, Apoorv Vyas, Bowen Shi, Andros Tjandra, Wei-Ning Hsu

Viaarxiv icon

On the Interplay between Fairness and Explainability

Oct 25, 2023
Stephanie Brandl, Emanuele Bugliarello, Ilias Chalkidis

Viaarxiv icon

Generation or Replication: Auscultating Audio Latent Diffusion Models

Oct 16, 2023
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

CgT-GAN: CLIP-guided Text GAN for Image Captioning

Aug 23, 2023
Jiarui Yu, Haoran Li, Yanbin Hao, Bin Zhu, Tong Xu, Xiangnan He

Figure 1 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Figure 2 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Figure 3 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Figure 4 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Viaarxiv icon

Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images

Oct 10, 2023
Che Liu, Anand Shah, Wenjia Bai, Rossella Arcucci

Figure 1 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Figure 2 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Figure 3 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Figure 4 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Viaarxiv icon

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models

Oct 30, 2023
Wai-Chung Kwan, Xingshan Zeng, Yufei Wang, Yusen Sun, Liangyou Li, Lifeng Shang, Qun Liu, Kam-Fai Wong

Figure 1 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 2 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 3 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 4 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Viaarxiv icon

Creating a silver standard for patent simplification

Oct 24, 2023
Silvia Casola, Alberto Lavelli, Horacio Saggion

Viaarxiv icon

Multi3DRefer: Grounding Text Description to Multiple 3D Objects

Sep 11, 2023
Yiming Zhang, ZeMing Gong, Angel X. Chang

Figure 1 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Figure 2 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Figure 3 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Figure 4 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Viaarxiv icon