Picture for Haibo Qiu

Haibo Qiu

MobileDreamer: Generative Sketch World Model for GUI Agent

Add code
Jan 07, 2026
Viaarxiv icon

Learning When to Look: A Disentangled Curriculum for Strategic Perception in Multimodal Reasoning

Add code
Dec 19, 2025
Viaarxiv icon

Context Cascade Compression: Exploring the Upper Limits of Text Compression

Add code
Nov 19, 2025
Figure 1 for Context Cascade Compression: Exploring the Upper Limits of Text Compression
Figure 2 for Context Cascade Compression: Exploring the Upper Limits of Text Compression
Figure 3 for Context Cascade Compression: Exploring the Upper Limits of Text Compression
Figure 4 for Context Cascade Compression: Exploring the Upper Limits of Text Compression
Viaarxiv icon

Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning

Add code
Oct 23, 2025
Viaarxiv icon

Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning

Add code
Jun 16, 2025
Viaarxiv icon

UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding

Add code
Apr 06, 2025
Viaarxiv icon

PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation

Add code
Oct 11, 2023
Figure 1 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Figure 2 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Figure 3 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Figure 4 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Viaarxiv icon

Collect-and-Distribute Transformer for 3D Point Cloud Analysis

Add code
Jun 02, 2023
Figure 1 for Collect-and-Distribute Transformer for 3D Point Cloud Analysis
Figure 2 for Collect-and-Distribute Transformer for 3D Point Cloud Analysis
Figure 3 for Collect-and-Distribute Transformer for 3D Point Cloud Analysis
Figure 4 for Collect-and-Distribute Transformer for 3D Point Cloud Analysis
Viaarxiv icon

GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation

Add code
Jul 06, 2022
Figure 1 for GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation
Figure 2 for GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation
Figure 3 for GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation
Figure 4 for GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation
Viaarxiv icon

End2End Occluded Face Recognition by Masking Corrupted Features

Add code
Aug 21, 2021
Figure 1 for End2End Occluded Face Recognition by Masking Corrupted Features
Figure 2 for End2End Occluded Face Recognition by Masking Corrupted Features
Figure 3 for End2End Occluded Face Recognition by Masking Corrupted Features
Figure 4 for End2End Occluded Face Recognition by Masking Corrupted Features
Viaarxiv icon