Alert button
Picture for Lei Li

Lei Li

Alert button

Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

Mar 21, 2024
Wei Chen, Yuxuan Liang, Yuanshao Zhu, Yanchuan Chang, Kang Luo, Haomin Wen, Lei Li, Yanwei Yu, Qingsong Wen, Chao Chen, Kai Zheng, Yunjun Gao, Xiaofang Zhou, Yu Zheng

Viaarxiv icon

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition

Mar 19, 2024
Jielin Qiu, William Han, Winfred Wang, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Christos Faloutsos, Lei Li, Lijuan Wang

Viaarxiv icon

Word Order's Impacts: Insights from Reordering and Generation Analysis

Mar 18, 2024
Qinghua Zhao, Jiaang Li, Lei Li, Zenghui Zhou, Junfeng Liu

Viaarxiv icon

Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction

Mar 15, 2024
Chen Chen, Lei Li, Marcel Beetz, Abhirup Banerjee, Ramneek Gupta, Vicente Grau

Viaarxiv icon

Tree Counting by Bridging 3D Point Clouds with Imagery

Mar 12, 2024
Lei Li, Tianfang Zhang, Zhongyu Jiang, Cheng-Yen Yang, Jenq-Neng Hwang, Stefan Oehmcke, Dimitri Pierre Johannes Gominski, Fabian Gieseke, Christian Igel

Figure 1 for Tree Counting by Bridging 3D Point Clouds with Imagery
Figure 2 for Tree Counting by Bridging 3D Point Clouds with Imagery
Figure 3 for Tree Counting by Bridging 3D Point Clouds with Imagery
Figure 4 for Tree Counting by Bridging 3D Point Clouds with Imagery
Viaarxiv icon

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Mar 07, 2024
Jielin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei Li, Babak Damavandi, Seungwhan Moon

Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder

Mar 07, 2024
Lei Li, Tianfang Zhang, Xinglin Zhang, Jiaqi Liu, Bingqi Ma, Yan Luo, Tao Chen

Figure 1 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Figure 2 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Figure 3 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Figure 4 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Viaarxiv icon

ImgTrojan: Jailbreaking Vision-Language Models with ONE Image

Mar 06, 2024
Xijia Tao, Shuai Zhong, Lei Li, Qi Liu, Lingpeng Kong

Figure 1 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 2 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 3 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 4 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Viaarxiv icon

Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models

Mar 04, 2024
Lei Li, Yuqi Wang, Runxin Xu, Peiyi Wang, Xiachong Feng, Lingpeng Kong, Qi Liu

Figure 1 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 2 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 3 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 4 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Viaarxiv icon