Picture for Mingliang Zhai

Mingliang Zhai

IA-T2I: Internet-Augmented Text-to-Image Generation

Add code
May 21, 2025
Viaarxiv icon

Memory-Centric Embodied Question Answer

Add code
May 20, 2025
Viaarxiv icon

World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving

Add code
Dec 09, 2024
Viaarxiv icon

Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding

Add code
May 19, 2023
Viaarxiv icon