Picture for Qi Zheng

Qi Zheng

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Add code
Aug 21, 2025
Viaarxiv icon

VLA-Mark: A cross modal watermark for large vision-language alignment model

Add code
Jul 18, 2025
Viaarxiv icon

4KAgent: Agentic Any Image to 4K Super-Resolution

Add code
Jul 09, 2025
Viaarxiv icon

Node Splitting SVMs for Survival Trees Based on an L2-Regularized Dipole Splitting Criteria

Add code
Jun 13, 2025
Viaarxiv icon

End-to-End HOI Reconstruction Transformer with Graph-based Encoding

Add code
Mar 08, 2025
Figure 1 for End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Figure 2 for End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Figure 3 for End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Figure 4 for End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Viaarxiv icon

An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation

Add code
Jan 25, 2025
Figure 1 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Figure 2 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Figure 3 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Figure 4 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Viaarxiv icon

ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting

Add code
Dec 19, 2024
Figure 1 for ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
Figure 2 for ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
Figure 3 for ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
Figure 4 for ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
Viaarxiv icon

Unicorn: Unified Neural Image Compression with One Number Reconstruction

Add code
Dec 11, 2024
Viaarxiv icon

Video Quality Assessment: A Comprehensive Survey

Add code
Dec 04, 2024
Viaarxiv icon

M3-CVC: Controllable Video Compression with Multimodal Generative Models

Add code
Nov 24, 2024
Figure 1 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Figure 2 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Figure 3 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Figure 4 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Viaarxiv icon