Picture for Guangtao Zhai

Guangtao Zhai

Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

Add code
Apr 16, 2025
Viaarxiv icon

Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model

Add code
Apr 15, 2025
Viaarxiv icon

PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving

Add code
Apr 15, 2025
Viaarxiv icon

FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment

Add code
Apr 12, 2025
Viaarxiv icon

Towards Explainable Partial-AIGC Image Quality Assessment

Add code
Apr 12, 2025
Viaarxiv icon

LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs

Add code
Apr 11, 2025
Viaarxiv icon

Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model

Add code
Apr 09, 2025
Viaarxiv icon

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Add code
Apr 03, 2025
Viaarxiv icon

Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes

Add code
Apr 02, 2025
Viaarxiv icon

Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy

Add code
Mar 27, 2025
Viaarxiv icon