Picture for Yuhui Zhang

Yuhui Zhang

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding

Add code
Jul 01, 2024
Viaarxiv icon

Why are Visually-Grounded Language Models Bad at Image Classification?

Add code
May 28, 2024
Viaarxiv icon

A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data

Add code
Mar 24, 2024
Figure 1 for A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data
Figure 2 for A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data
Figure 3 for A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data
Figure 4 for A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Mar 15, 2024
Figure 1 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 2 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 3 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Figure 4 for VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Viaarxiv icon

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Add code
Jan 16, 2024
Viaarxiv icon

Describing Differences in Image Sets with Natural Language

Add code
Dec 05, 2023
Figure 1 for Describing Differences in Image Sets with Natural Language
Figure 2 for Describing Differences in Image Sets with Natural Language
Figure 3 for Describing Differences in Image Sets with Natural Language
Figure 4 for Describing Differences in Image Sets with Natural Language
Viaarxiv icon

Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

Add code
Nov 27, 2023
Figure 1 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Figure 2 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Figure 3 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Figure 4 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Viaarxiv icon

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

Add code
Oct 31, 2023
Viaarxiv icon

Can large language models provide useful feedback on research papers? A large-scale empirical analysis

Add code
Oct 03, 2023
Figure 1 for Can large language models provide useful feedback on research papers? A large-scale empirical analysis
Figure 2 for Can large language models provide useful feedback on research papers? A large-scale empirical analysis
Figure 3 for Can large language models provide useful feedback on research papers? A large-scale empirical analysis
Figure 4 for Can large language models provide useful feedback on research papers? A large-scale empirical analysis
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Figure 1 for Inverse Scaling: When Bigger Isn't Better
Figure 2 for Inverse Scaling: When Bigger Isn't Better
Figure 3 for Inverse Scaling: When Bigger Isn't Better
Figure 4 for Inverse Scaling: When Bigger Isn't Better
Viaarxiv icon