Alert button

"Text": models, code, and papers
Alert button

On the Interplay between Fairness and Explainability

Oct 25, 2023
Stephanie Brandl, Emanuele Bugliarello, Ilias Chalkidis

Viaarxiv icon

Generation or Replication: Auscultating Audio Latent Diffusion Models

Oct 16, 2023
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

CgT-GAN: CLIP-guided Text GAN for Image Captioning

Aug 23, 2023
Jiarui Yu, Haoran Li, Yanbin Hao, Bin Zhu, Tong Xu, Xiangnan He

Figure 1 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Figure 2 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Figure 3 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Figure 4 for CgT-GAN: CLIP-guided Text GAN for Image Captioning
Viaarxiv icon

Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images

Oct 10, 2023
Che Liu, Anand Shah, Wenjia Bai, Rossella Arcucci

Figure 1 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Figure 2 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Figure 3 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Figure 4 for Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images
Viaarxiv icon

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models

Oct 30, 2023
Wai-Chung Kwan, Xingshan Zeng, Yufei Wang, Yusen Sun, Liangyou Li, Lifeng Shang, Qun Liu, Kam-Fai Wong

Figure 1 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 2 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 3 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 4 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Viaarxiv icon

Creating a silver standard for patent simplification

Oct 24, 2023
Silvia Casola, Alberto Lavelli, Horacio Saggion

Viaarxiv icon

Multi3DRefer: Grounding Text Description to Multiple 3D Objects

Sep 11, 2023
Yiming Zhang, ZeMing Gong, Angel X. Chang

Figure 1 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Figure 2 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Figure 3 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Figure 4 for Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Viaarxiv icon

ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing

Oct 17, 2023
Quoc-Nam Nguyen, Thang Chau Phan, Duc-Vu Nguyen, Kiet Van Nguyen

Viaarxiv icon

MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks

Oct 08, 2023
Jingyuan Qi, Minqian Liu, Ying Shen, Zhiyang Xu, Lifu Huang

Viaarxiv icon

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

Oct 18, 2023
Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond Chan, Ying Shan

Figure 1 for EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Figure 2 for EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Figure 3 for EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Figure 4 for EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Viaarxiv icon