Alert button
Picture for Lei Li

Lei Li

Alert button

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition

Add code
Bookmark button
Alert button
Mar 19, 2024
Jielin Qiu, William Han, Winfred Wang, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Christos Faloutsos, Lei Li, Lijuan Wang

Figure 1 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Figure 2 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Figure 3 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Figure 4 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Viaarxiv icon

Word Order's Impacts: Insights from Reordering and Generation Analysis

Add code
Bookmark button
Alert button
Mar 18, 2024
Qinghua Zhao, Jiaang Li, Lei Li, Zenghui Zhou, Junfeng Liu

Figure 1 for Word Order's Impacts: Insights from Reordering and Generation Analysis
Figure 2 for Word Order's Impacts: Insights from Reordering and Generation Analysis
Viaarxiv icon

Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction

Add code
Bookmark button
Alert button
Mar 15, 2024
Chen Chen, Lei Li, Marcel Beetz, Abhirup Banerjee, Ramneek Gupta, Vicente Grau

Figure 1 for Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction
Figure 2 for Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction
Figure 3 for Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction
Figure 4 for Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction
Viaarxiv icon

Tree Counting by Bridging 3D Point Clouds with Imagery

Add code
Bookmark button
Alert button
Mar 12, 2024
Lei Li, Tianfang Zhang, Zhongyu Jiang, Cheng-Yen Yang, Jenq-Neng Hwang, Stefan Oehmcke, Dimitri Pierre Johannes Gominski, Fabian Gieseke, Christian Igel

Figure 1 for Tree Counting by Bridging 3D Point Clouds with Imagery
Figure 2 for Tree Counting by Bridging 3D Point Clouds with Imagery
Figure 3 for Tree Counting by Bridging 3D Point Clouds with Imagery
Figure 4 for Tree Counting by Bridging 3D Point Clouds with Imagery
Viaarxiv icon

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Bookmark button
Alert button
Mar 07, 2024
Jielin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei Li, Babak Damavandi, Seungwhan Moon

Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder

Add code
Bookmark button
Alert button
Mar 07, 2024
Lei Li, Tianfang Zhang, Xinglin Zhang, Jiaqi Liu, Bingqi Ma, Yan Luo, Tao Chen

Figure 1 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Figure 2 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Figure 3 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Figure 4 for MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder
Viaarxiv icon

ImgTrojan: Jailbreaking Vision-Language Models with ONE Image

Add code
Bookmark button
Alert button
Mar 06, 2024
Xijia Tao, Shuai Zhong, Lei Li, Qi Liu, Lingpeng Kong

Figure 1 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 2 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 3 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Figure 4 for ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
Viaarxiv icon

Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models

Add code
Bookmark button
Alert button
Mar 04, 2024
Lei Li, Yuqi Wang, Runxin Xu, Peiyi Wang, Xiachong Feng, Lingpeng Kong, Qi Liu

Figure 1 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 2 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 3 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 4 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Viaarxiv icon

TempCompass: Do Video LLMs Really Understand Videos?

Add code
Bookmark button
Alert button
Mar 01, 2024
Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 2 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 3 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 4 for TempCompass: Do Video LLMs Really Understand Videos?
Viaarxiv icon