Picture for Xiaoyi Zhang

Xiaoyi Zhang

Generative Video Compression with One-Dimensional Latent Representation

Add code
Mar 16, 2026
Viaarxiv icon

Compression as Adaptation: Implicit Visual Representation with Diffusion Foundation Models

Add code
Mar 08, 2026
Viaarxiv icon

From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation

Add code
Jan 09, 2026
Viaarxiv icon

InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Add code
Jan 08, 2026
Viaarxiv icon

UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

Add code
May 28, 2025
Figure 1 for UI-Evol: Automatic Knowledge Evolving for Computer Use Agents
Figure 2 for UI-Evol: Automatic Knowledge Evolving for Computer Use Agents
Figure 3 for UI-Evol: Automatic Knowledge Evolving for Computer Use Agents
Figure 4 for UI-Evol: Automatic Knowledge Evolving for Computer Use Agents
Viaarxiv icon

Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding

Add code
May 23, 2025
Viaarxiv icon

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis

Add code
Apr 16, 2025
Figure 1 for UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis
Figure 2 for UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis
Figure 3 for UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis
Figure 4 for UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis
Viaarxiv icon

Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis

Add code
May 13, 2024
Figure 1 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 2 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 3 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 4 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Viaarxiv icon

Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference

Add code
Apr 03, 2024
Figure 1 for Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference
Figure 2 for Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference
Figure 3 for Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference
Figure 4 for Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference
Viaarxiv icon

GO-FEAP: Global Optimal UAV Planner Using Frontier-Omission-Aware Exploration and Altitude-Stratified Planning

Add code
Oct 24, 2023
Viaarxiv icon