Alert button
Picture for Yixiao Ge

Yixiao Ge

Alert button

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Dec 11, 2023
Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu

Figure 1 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Figure 2 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Figure 3 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Figure 4 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Viaarxiv icon

SEED-Bench-2: Benchmarking Multimodal Large Language Models

Add code
Bookmark button
Alert button
Nov 28, 2023
Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan

Viaarxiv icon

ViT-Lens-2: Gateway to Omni-modal Intelligence

Add code
Bookmark button
Alert button
Nov 27, 2023
Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou

Viaarxiv icon

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Add code
Bookmark button
Alert button
Nov 27, 2023
Xiaohan Ding, Yiyuan Zhang, Yixiao Ge, Sijie Zhao, Lin Song, Xiangyu Yue, Ying Shan

Viaarxiv icon

Vision-Language Instruction Tuning: A Review and Analysis

Add code
Bookmark button
Alert button
Nov 25, 2023
Chen Li, Yixiao Ge, Dian Li, Ying Shan

Viaarxiv icon

Meta-Adapter: An Online Few-shot Learner for Vision-Language Model

Add code
Bookmark button
Alert button
Nov 07, 2023
Cheng Cheng, Lin Song, Ruoyi Xue, Hang Wang, Hongbin Sun, Yixiao Ge, Ying Shan

Figure 1 for Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Figure 2 for Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Figure 3 for Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Figure 4 for Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Viaarxiv icon

Making LLaMA SEE and Draw with SEED Tokenizer

Add code
Bookmark button
Alert button
Oct 02, 2023
Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan

Viaarxiv icon

One For All: Video Conversation is Feasible Without Video Instruction Tuning

Add code
Bookmark button
Alert button
Sep 27, 2023
Ruyang Liu, Chen Li, Yixiao Ge, Ying Shan, Thomas H. Li, Ge Li

Viaarxiv icon

Equivariant Symmetries for Inertial Navigation Systems

Add code
Bookmark button
Alert button
Sep 07, 2023
Alessandro Fornasier, Yixiao Ge, Pieter van Goor, Robert Mahony, Stephan Weiss

Figure 1 for Equivariant Symmetries for Inertial Navigation Systems
Figure 2 for Equivariant Symmetries for Inertial Navigation Systems
Figure 3 for Equivariant Symmetries for Inertial Navigation Systems
Figure 4 for Equivariant Symmetries for Inertial Navigation Systems
Viaarxiv icon