Alert button
Picture for Shen Yan

Shen Yan

Alert button

Streaming Dense Video Captioning

Add code
Bookmark button
Alert button
Apr 01, 2024
Xingyi Zhou, Anurag Arnab, Shyamal Buch, Shen Yan, Austin Myers, Xuehan Xiong, Arsha Nagrani, Cordelia Schmid

Viaarxiv icon

VideoPrism: A Foundational Visual Encoder for Video Understanding

Add code
Bookmark button
Alert button
Feb 20, 2024
Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

Figure 1 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Figure 2 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Figure 3 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Figure 4 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Viaarxiv icon

PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter

Add code
Bookmark button
Alert button
Feb 16, 2024
Junfei Xiao, Zheng Xu, Alan Yuille, Shen Yan, Boyu Wang

Viaarxiv icon

UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization

Add code
Bookmark button
Alert button
Jan 11, 2024
Rouwan Wu, Xiaoya Cheng, Juelin Zhu, Xuxiang Liu, Maojun Zhang, Shen Yan

Viaarxiv icon

Efficient Large Language Models: A Survey

Add code
Bookmark button
Alert button
Dec 23, 2023
Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang

Viaarxiv icon

Pixel Aligned Language Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid

Viaarxiv icon

UnLoc: A Unified Framework for Video Localization Tasks

Add code
Bookmark button
Alert button
Aug 21, 2023
Shen Yan, Xuehan Xiong, Arsha Nagrani, Anurag Arnab, Zhonghao Wang, Weina Ge, David Ross, Cordelia Schmid

Figure 1 for UnLoc: A Unified Framework for Video Localization Tasks
Figure 2 for UnLoc: A Unified Framework for Video Localization Tasks
Figure 3 for UnLoc: A Unified Framework for Video Localization Tasks
Figure 4 for UnLoc: A Unified Framework for Video Localization Tasks
Viaarxiv icon

AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Add code
Bookmark button
Alert button
Apr 20, 2023
Yang Liu, Shen Yan, Yuge Zhang, Kan Ren, Quanlu Zhang, Zebin Ren, Deng Cai, Mi Zhang

Figure 1 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning
Figure 2 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning
Figure 3 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning
Figure 4 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning
Viaarxiv icon

Long-term Visual Localization with Mobile Sensors

Add code
Bookmark button
Alert button
Apr 16, 2023
Shen Yan, Yu Liu, Long Wang, Zehong Shen, Zhen Peng, Haomin Liu, Maojun Zhang, Guofeng Zhang, Xiaowei Zhou

Figure 1 for Long-term Visual Localization with Mobile Sensors
Figure 2 for Long-term Visual Localization with Mobile Sensors
Figure 3 for Long-term Visual Localization with Mobile Sensors
Figure 4 for Long-term Visual Localization with Mobile Sensors
Viaarxiv icon

Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks

Add code
Bookmark button
Alert button
Mar 07, 2023
Qingyan Meng, Mingqing Xiao, Shen Yan, Yisen Wang, Zhouchen Lin, Zhi-Quan Luo

Figure 1 for Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
Figure 2 for Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
Figure 3 for Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
Figure 4 for Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
Viaarxiv icon