Alert button
Picture for Maria Wang

Maria Wang

Alert button

JD

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Add code
Bookmark button
Alert button
Feb 19, 2024
Gilles Baechler, Srinivas Sunkara, Maria Wang, Fedir Zubach, Hassan Mansoor, Vincent Etter, Victor Cărbune, Jason Lin, Jindong Chen, Abhanshu Sharma

Viaarxiv icon

Towards Better Semantic Understanding of Mobile Interfaces

Add code
Bookmark button
Alert button
Oct 06, 2022
Srinivas Sunkara, Maria Wang, Lijuan Liu, Gilles Baechler, Yu-Chung Hsiao, Jindong, Chen, Abhanshu Sharma, James Stout

Figure 1 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 2 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 3 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 4 for Towards Better Semantic Understanding of Mobile Interfaces
Viaarxiv icon

ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Add code
Bookmark button
Alert button
Sep 16, 2022
Yu-Chung Hsiao, Fedir Zubach, Maria Wang, Jindong, Chen

Figure 1 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Figure 2 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Figure 3 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Figure 4 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Viaarxiv icon

PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling

Add code
Bookmark button
Alert button
Jul 06, 2021
Xiaoxue Zang, Lijuan Liu, Maria Wang, Yang Song, Hao Zhang, Jindong Chen

Figure 1 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Figure 2 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Figure 3 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Figure 4 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Viaarxiv icon