Picture for Xueyuan Leng

Xueyuan Leng

ScreenAgent: A Vision Language Model-driven Computer Control Agent

Add code
Feb 09, 2024
Figure 1 for ScreenAgent: A Vision Language Model-driven Computer Control Agent
Figure 2 for ScreenAgent: A Vision Language Model-driven Computer Control Agent
Figure 3 for ScreenAgent: A Vision Language Model-driven Computer Control Agent
Figure 4 for ScreenAgent: A Vision Language Model-driven Computer Control Agent
Viaarxiv icon