Picture for Wanqi Zhong

Wanqi Zhong

Token Predictors Are Not Planners: Building Physically Grounded Causal Reasoners

Add code
Jun 01, 2026
Viaarxiv icon

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Add code
May 18, 2024
Figure 1 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Figure 2 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Figure 3 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Figure 4 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Viaarxiv icon

A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering

Add code
Nov 13, 2023
Viaarxiv icon