Picture for Alexander Visheratin

Alexander Visheratin

Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions

Add code
Nov 10, 2025
Figure 1 for Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions
Figure 2 for Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions
Figure 3 for Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions
Figure 4 for Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions
Viaarxiv icon

Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models

Add code
Sep 16, 2024
Figure 1 for Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models
Figure 2 for Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models
Figure 3 for Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models
Figure 4 for Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models
Viaarxiv icon

NLLB-CLIP -- train performant multilingual image retrieval model on a budget

Add code
Sep 04, 2023
Viaarxiv icon