Alert button
Picture for Yixiao Ge

Yixiao Ge

Alert button

SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension

Add code
Bookmark button
Alert button
Apr 25, 2024
Bohao Li, Yuying Ge, Yi Chen, Yixiao Ge, Ruimao Zhang, Ying Shan

Viaarxiv icon

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

Add code
Bookmark button
Alert button
Apr 22, 2024
Yuying Ge, Sijie Zhao, Jinguo Zhu, Yixiao Ge, Kun Yi, Lin Song, Chen Li, Xiaohan Ding, Ying Shan

Viaarxiv icon

ST-LLM: Large Language Models Are Effective Temporal Learners

Add code
Bookmark button
Alert button
Mar 30, 2024
Ruyang Liu, Chen Li, Haoran Tang, Yixiao Ge, Ying Shan, Ge Li

Viaarxiv icon

YOLO-World: Real-Time Open-Vocabulary Object Detection

Add code
Bookmark button
Alert button
Feb 02, 2024
Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan

Viaarxiv icon

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Add code
Bookmark button
Alert button
Jan 25, 2024
Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue

Viaarxiv icon

Supervised Fine-tuning in turn Improves Visual Foundation Models

Add code
Bookmark button
Alert button
Jan 18, 2024
Xiaohu Jiang, Yixiao Ge, Yuying Ge, Chun Yuan, Ying Shan

Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Add code
Bookmark button
Alert button
Jan 15, 2024
Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Add code
Bookmark button
Alert button
Jan 04, 2024
Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan

Viaarxiv icon

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Add code
Bookmark button
Alert button
Dec 20, 2023
Zhaoyang Zhang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, Ping Luo

Viaarxiv icon

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Add code
Bookmark button
Alert button
Dec 14, 2023
Jinguo Zhu, Xiaohan Ding, Yixiao Ge, Yuying Ge, Sijie Zhao, Hengshuang Zhao, Xiaohua Wang, Ying Shan

Viaarxiv icon