Picture for An Yan

An Yan

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Add code
Aug 16, 2024
Viaarxiv icon

CRAG -- Comprehensive RAG Benchmark

Add code
Jun 07, 2024
Figure 1 for CRAG -- Comprehensive RAG Benchmark
Figure 2 for CRAG -- Comprehensive RAG Benchmark
Figure 3 for CRAG -- Comprehensive RAG Benchmark
Figure 4 for CRAG -- Comprehensive RAG Benchmark
Viaarxiv icon

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Add code
Apr 25, 2024
Figure 1 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Figure 2 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Figure 3 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Figure 4 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Viaarxiv icon

Bridging Language and Items for Retrieval and Recommendation

Add code
Mar 06, 2024
Figure 1 for Bridging Language and Items for Retrieval and Recommendation
Figure 2 for Bridging Language and Items for Retrieval and Recommendation
Figure 3 for Bridging Language and Items for Retrieval and Recommendation
Figure 4 for Bridging Language and Items for Retrieval and Recommendation
Viaarxiv icon

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Add code
Nov 13, 2023
Figure 1 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Figure 2 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Figure 3 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Figure 4 for GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Figure 1 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 2 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 3 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 4 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Viaarxiv icon

MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation

Add code
Oct 27, 2023
Figure 1 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Figure 2 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Figure 3 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Figure 4 for MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Viaarxiv icon

Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving

Add code
Oct 26, 2023
Figure 1 for Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
Figure 2 for Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
Figure 3 for Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
Figure 4 for Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
Viaarxiv icon

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

Add code
Oct 04, 2023
Figure 1 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 2 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 3 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 4 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Viaarxiv icon

Learning Concise and Descriptive Attributes for Visual Recognition

Add code
Aug 07, 2023
Figure 1 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 2 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 3 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 4 for Learning Concise and Descriptive Attributes for Visual Recognition
Viaarxiv icon