Alert button
Picture for Yan Lu

Yan Lu

Alert button

Uncertainty-Aware Deep Video Compression with Ensembles

Add code
Bookmark button
Alert button
Mar 28, 2024
Wufei Ma, Jiahao Li, Bin Li, Yan Lu

Viaarxiv icon

RelationVLM: Making Large Vision-Language Models Understand Visual Relations

Add code
Bookmark button
Alert button
Mar 19, 2024
Zhipeng Huang, Zhizheng Zhang, Zheng-Jun Zha, Yan Lu, Baining Guo

Figure 1 for RelationVLM: Making Large Vision-Language Models Understand Visual Relations
Figure 2 for RelationVLM: Making Large Vision-Language Models Understand Visual Relations
Figure 3 for RelationVLM: Making Large Vision-Language Models Understand Visual Relations
Figure 4 for RelationVLM: Making Large Vision-Language Models Understand Visual Relations
Viaarxiv icon

Neural Video Compression with Feature Modulation

Add code
Bookmark button
Alert button
Feb 29, 2024
Jiahao Li, Bin Li, Yan Lu

Viaarxiv icon

Slot-VLM: SlowFast Slots for Video-Language Modeling

Add code
Bookmark button
Alert button
Feb 20, 2024
Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu

Viaarxiv icon

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement

Add code
Bookmark button
Alert button
Feb 15, 2024
Tao Yang, Cuiling Lan, Yan Lu, Nanning zheng

Viaarxiv icon

Masked Audio Modeling with CLAP and Multi-Objective Learning

Add code
Bookmark button
Alert button
Jan 29, 2024
Yifei Xin, Xiulian Peng, Yan Lu

Viaarxiv icon

Retrieval-based Video Language Model for Efficient Long Video Question Answering

Add code
Bookmark button
Alert button
Dec 08, 2023
Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu

Figure 1 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 2 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 3 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 4 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Viaarxiv icon

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection

Add code
Bookmark button
Alert button
Oct 24, 2023
Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Tong He, Yonghui Li, Wanli Ouyang

Figure 1 for GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Figure 2 for GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Figure 3 for GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Figure 4 for GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Viaarxiv icon

Low-latency Speech Enhancement via Speech Token Generation

Add code
Bookmark button
Alert button
Oct 20, 2023
Huaying Xue, Xiulian Peng, Yan Lu

Figure 1 for Low-latency Speech Enhancement via Speech Token Generation
Figure 2 for Low-latency Speech Enhancement via Speech Token Generation
Figure 3 for Low-latency Speech Enhancement via Speech Token Generation
Figure 4 for Low-latency Speech Enhancement via Speech Token Generation
Viaarxiv icon