Picture for Jielei Zhang

Jielei Zhang

Large Language Model as Token Compressor and Decompressor

Add code
Mar 26, 2026
Viaarxiv icon

Efficient Causal Structure Learning via Modular Subgraph Integration

Add code
Jan 28, 2026
Viaarxiv icon

Improving VQA Reliability: A Dual-Assessment Approach with Self-Reflection and Cross-Model Verification

Add code
Dec 16, 2025
Viaarxiv icon

MeshRipple: Structured Autoregressive Generation of Artist-Meshes

Add code
Dec 09, 2025
Viaarxiv icon

Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models

Add code
Sep 17, 2025
Figure 1 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Figure 2 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Figure 3 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Figure 4 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Viaarxiv icon

TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis

Add code
May 23, 2025
Figure 1 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Figure 2 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Figure 3 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Figure 4 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Viaarxiv icon

MX-Font++: Mixture of Heterogeneous Aggregation Experts for Few-shot Font Generation

Add code
Mar 04, 2025
Viaarxiv icon

DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training

Add code
Aug 01, 2024
Figure 1 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 2 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 3 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 4 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Viaarxiv icon

Facial Attribute Transformers for Precise and Robust Makeup Transfer

Add code
Apr 07, 2021
Figure 1 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Figure 2 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Figure 3 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Figure 4 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Viaarxiv icon

On Vocabulary Reliance in Scene Text Recognition

Add code
May 08, 2020
Figure 1 for On Vocabulary Reliance in Scene Text Recognition
Figure 2 for On Vocabulary Reliance in Scene Text Recognition
Figure 3 for On Vocabulary Reliance in Scene Text Recognition
Figure 4 for On Vocabulary Reliance in Scene Text Recognition
Viaarxiv icon