Picture for Guangtao Zhai

Guangtao Zhai

Affiliation 1, Affiliation 2

Robust Mesh Saliency GT Acquisition in VR via View Cone Sampling and Geometric Smoothing

Add code
Jan 06, 2026
Viaarxiv icon

VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on

Add code
Jan 06, 2026
Viaarxiv icon

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Add code
Dec 19, 2025
Figure 1 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 2 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 3 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Figure 4 for Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Figure 1 for Using GUI Agent for Electronic Design Automation
Figure 2 for Using GUI Agent for Electronic Design Automation
Figure 3 for Using GUI Agent for Electronic Design Automation
Figure 4 for Using GUI Agent for Electronic Design Automation
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation

Add code
Nov 18, 2025
Figure 1 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 2 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 3 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Figure 4 for ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

Data Assessment for Embodied Intelligence

Add code
Nov 12, 2025
Viaarxiv icon

MACEval: A Multi-Agent Continual Evaluation Network for Large Models

Add code
Nov 12, 2025
Viaarxiv icon