Alert button
Picture for Shiyi Lan

Shiyi Lan

Alert button

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks

Jan 31, 2024
Shijia Liao, Shiyi Lan, Arun George Zachariah

Viaarxiv icon

Fully Attentional Networks with Self-emerging Token Labeling

Jan 08, 2024
Bingyin Zhao, Zhiding Yu, Shiyi Lan, Yutao Cheng, Anima Anandkumar, Yingjie Lao, Jose M. Alvarez

Viaarxiv icon

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Dec 21, 2023
Junfei Xiao, Ziqi Zhou, Wenxuan Li, Shiyi Lan, Jieru Mei, Zhiding Yu, Alan Yuille, Yuyin Zhou, Cihang Xie

Viaarxiv icon

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

Dec 05, 2023
Zhiqi Li, Zhiding Yu, Shiyi Lan, Jiahan Li, Jan Kautz, Tong Lu, Jose M. Alvarez

Viaarxiv icon

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

Dec 04, 2023
Zhenxin Li, Shiyi Lan, Jose M. Alvarez, Zuxuan Wu

Viaarxiv icon

Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding

Nov 30, 2023
Wujian Peng, Sicheng Xie, Zuyao You, Shiyi Lan, Zuxuan Wu

Viaarxiv icon

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Nov 24, 2023
Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M. Alvarez, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

Aug 08, 2023
Yilun Chen, Zhiding Yu, Yukang Chen, Shiyi Lan, Animashree Anandkumar, Jiaya Jia, Jose Alvarez

Figure 1 for FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
Figure 2 for FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
Figure 3 for FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
Figure 4 for FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
Viaarxiv icon

FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation

Jul 04, 2023
Zhiqi Li, Zhiding Yu, David Austin, Mingsheng Fang, Shiyi Lan, Jan Kautz, Jose M. Alvarez

Figure 1 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Figure 2 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Figure 3 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Figure 4 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Viaarxiv icon

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

Feb 09, 2023
Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar

Figure 1 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 2 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 3 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 4 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Viaarxiv icon