Picture for Xiaohong Liu

Xiaohong Liu

Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model

Add code
Apr 15, 2025
Viaarxiv icon

PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving

Add code
Apr 15, 2025
Viaarxiv icon

ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model

Add code
Apr 15, 2025
Viaarxiv icon

Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model

Add code
Apr 09, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Viaarxiv icon

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Add code
Mar 26, 2025
Viaarxiv icon

Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector

Add code
Mar 26, 2025
Viaarxiv icon

Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing

Add code
Mar 25, 2025
Viaarxiv icon

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Add code
Mar 20, 2025
Viaarxiv icon

Variational Bayesian Personalized Ranking

Add code
Mar 14, 2025
Viaarxiv icon