Picture for Bingchen Zhao

Bingchen Zhao

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

Add code
Jun 18, 2024
Figure 1 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 2 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 3 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 4 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Viaarxiv icon

What If We Recaption Billions of Web Images with LLaMA-3?

Add code
Jun 12, 2024
Figure 1 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 2 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 3 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 4 for What If We Recaption Billions of Web Images with LLaMA-3?
Viaarxiv icon

Labeled Data Selection for Category Discovery

Add code
Jun 07, 2024
Viaarxiv icon

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights

Add code
May 31, 2024
Viaarxiv icon

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

Add code
Apr 15, 2024
Viaarxiv icon

Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery

Add code
Apr 13, 2024
Figure 1 for Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery
Figure 2 for Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery
Figure 3 for Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery
Figure 4 for Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery
Viaarxiv icon

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Add code
Apr 10, 2024
Viaarxiv icon

Beyond the Known: Novel Class Discovery for Open-world Graph Learning

Add code
Mar 29, 2024
Figure 1 for Beyond the Known: Novel Class Discovery for Open-world Graph Learning
Figure 2 for Beyond the Known: Novel Class Discovery for Open-world Graph Learning
Figure 3 for Beyond the Known: Novel Class Discovery for Open-world Graph Learning
Figure 4 for Beyond the Known: Novel Class Discovery for Open-world Graph Learning
Viaarxiv icon

AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability

Add code
Feb 14, 2024
Figure 1 for AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
Figure 2 for AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
Figure 3 for AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
Figure 4 for AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
Viaarxiv icon

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

Add code
Dec 18, 2023
Viaarxiv icon