Picture for Xi Zhang

Xi Zhang

Macquarie University, Australia

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration

Add code
Feb 25, 2025
Figure 1 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Figure 2 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Figure 3 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Figure 4 for Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration
Viaarxiv icon

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Add code
Feb 21, 2025
Viaarxiv icon

Qwen2.5-VL Technical Report

Add code
Feb 19, 2025
Figure 1 for Qwen2.5-VL Technical Report
Figure 2 for Qwen2.5-VL Technical Report
Figure 3 for Qwen2.5-VL Technical Report
Figure 4 for Qwen2.5-VL Technical Report
Viaarxiv icon

SCCD: A Session-based Dataset for Chinese Cyberbullying Detection

Add code
Jan 25, 2025
Figure 1 for SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Figure 2 for SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Figure 3 for SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Figure 4 for SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Viaarxiv icon

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Add code
Jan 20, 2025
Figure 1 for Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
Figure 2 for Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
Figure 3 for Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
Figure 4 for Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
Viaarxiv icon

Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation

Add code
Dec 06, 2024
Figure 1 for Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation
Figure 2 for Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation
Figure 3 for Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation
Figure 4 for Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation
Viaarxiv icon

TSUBF-Net: Trans-Spatial UNet-like Network with Bi-direction Fusion for Segmentation of Adenoid Hypertrophy in CT

Add code
Dec 01, 2024
Viaarxiv icon

Libra: Leveraging Temporal Images for Biomedical Radiology Analysis

Add code
Nov 28, 2024
Figure 1 for Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Figure 2 for Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Figure 3 for Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Figure 4 for Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Viaarxiv icon

Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression

Add code
Nov 25, 2024
Viaarxiv icon

Trajectory Flow Matching with Applications to Clinical Time Series Modeling

Add code
Oct 28, 2024
Viaarxiv icon