Picture for An Yan

An Yan

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Add code
Nov 13, 2023
Figure 1 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Figure 2 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Figure 3 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Figure 4 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Viaarxiv icon

MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation

Add code
Oct 27, 2023
Figure 1 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Figure 2 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Figure 3 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Figure 4 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Viaarxiv icon

Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving

Add code
Oct 26, 2023
Viaarxiv icon

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

Add code
Oct 04, 2023
Figure 1 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 2 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 3 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 4 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Viaarxiv icon

Learning Concise and Descriptive Attributes for Visual Recognition

Add code
Aug 07, 2023
Figure 1 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 2 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 3 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 4 for Learning Concise and Descriptive Attributes for Visual Recognition
Viaarxiv icon

Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews

Add code
Jul 23, 2023
Figure 1 for Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews
Figure 2 for Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews
Figure 3 for Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews
Viaarxiv icon

"Nothing Abnormal": Disambiguating Medical Reports via Contrastive Knowledge Infusion

Add code
May 15, 2023
Viaarxiv icon

CLIP also Understands Text: Prompting CLIP for Phrase Understanding

Add code
Oct 11, 2022
Figure 1 for CLIP also Understands Text: Prompting CLIP for Phrase Understanding
Figure 2 for CLIP also Understands Text: Prompting CLIP for Phrase Understanding
Figure 3 for CLIP also Understands Text: Prompting CLIP for Phrase Understanding
Figure 4 for CLIP also Understands Text: Prompting CLIP for Phrase Understanding
Viaarxiv icon

Visualize Before You Write: Imagination-Guided Open-Ended Text Generation

Add code
Oct 07, 2022
Figure 1 for Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Figure 2 for Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Figure 3 for Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Figure 4 for Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Viaarxiv icon