Picture for Xihui Liu

Xihui Liu

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Add code
Oct 14, 2025
Viaarxiv icon

DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation

Add code
Sep 19, 2025
Viaarxiv icon

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Add code
Sep 18, 2025
Viaarxiv icon

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Add code
Sep 11, 2025
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Add code
Aug 24, 2025
Figure 1 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Figure 2 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Figure 3 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Figure 4 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Viaarxiv icon

GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation

Add code
Aug 19, 2025
Viaarxiv icon

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Add code
Jul 10, 2025
Viaarxiv icon

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Add code
Jul 08, 2025
Viaarxiv icon

DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation

Add code
Jul 03, 2025
Viaarxiv icon