Multilingual Text To Image Generation


MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation

Add code
Mar 25, 2026
Viaarxiv icon

LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control

Add code
Mar 10, 2026
Viaarxiv icon

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

Add code
Mar 12, 2026
Viaarxiv icon

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion

Add code
Feb 25, 2026
Viaarxiv icon

SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation

Add code
Jan 23, 2026
Viaarxiv icon

Brazilian Portuguese Image Captioning with Transformers: A Study on Cross-Native-Translated Dataset

Add code
Jan 30, 2026
Viaarxiv icon

Bridging Lexical Ambiguity and Vision: A Mini Review on Visual Word Sense Disambiguation

Add code
Feb 01, 2026
Viaarxiv icon

Multilingual-To-Multimodal (M2M): Unlocking New Languages with Monolingual Text

Add code
Jan 15, 2026
Viaarxiv icon

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Add code
Jan 16, 2026
Viaarxiv icon

A Generalist Foundation Model for Total-body PET/CT Enables Diagnostic Reporting and System-wide Metabolic Profiling

Add code
Jan 19, 2026
Viaarxiv icon