Picture for Yuyi Zhang

Yuyi Zhang

PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography

Add code
Jan 07, 2026
Viaarxiv icon

Do Latent Tokens Think? A Causal and Adversarial Analysis of Chain-of-Continuous-Thought

Add code
Dec 25, 2025
Viaarxiv icon

Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations

Add code
Jul 16, 2025
Figure 1 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 2 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 3 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 4 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Viaarxiv icon

MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers

Add code
Jul 09, 2025
Viaarxiv icon

MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

Add code
Jun 05, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Image Super-Resolution ($\times$4): Methods and Results

Add code
Apr 20, 2025
Viaarxiv icon

Predicting the Original Appearance of Damaged Historical Documents

Add code
Dec 16, 2024
Figure 1 for Predicting the Original Appearance of Damaged Historical Documents
Figure 2 for Predicting the Original Appearance of Damaged Historical Documents
Figure 3 for Predicting the Original Appearance of Damaged Historical Documents
Figure 4 for Predicting the Original Appearance of Damaged Historical Documents
Viaarxiv icon

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Mar 20, 2024
Figure 1 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 2 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 3 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 4 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Viaarxiv icon

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Add code
Dec 19, 2023
Viaarxiv icon

Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation

Add code
Oct 29, 2023
Figure 1 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 2 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 3 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Figure 4 for Exploring OCR Capabilities of GPT-4V : A Quantitative and In-depth Evaluation
Viaarxiv icon