Picture for Yu Qiao

Yu Qiao

ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Add code
Apr 08, 2025
Viaarxiv icon

ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting

Add code
Apr 02, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Figure 1 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 2 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 3 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Figure 4 for Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Viaarxiv icon

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Add code
Mar 27, 2025
Viaarxiv icon

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Add code
Mar 25, 2025
Figure 1 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Figure 2 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Figure 3 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Figure 4 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Viaarxiv icon

Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy

Add code
Mar 25, 2025
Viaarxiv icon

MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset

Add code
Mar 17, 2025
Figure 1 for MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset
Figure 2 for MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset
Figure 3 for MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset
Figure 4 for MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset
Viaarxiv icon

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Add code
Mar 13, 2025
Figure 1 for VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
Figure 2 for VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
Figure 3 for VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
Figure 4 for VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
Viaarxiv icon

Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey

Add code
Mar 13, 2025
Figure 1 for Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey
Figure 2 for Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey
Figure 3 for Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey
Figure 4 for Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey
Viaarxiv icon