Alert button
Picture for Lei Zhang

Lei Zhang

Alert button

IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting

Add code
Bookmark button
Alert button
Mar 20, 2024
Hang Wang, Zhi-Qi Cheng, Youtian Du, Lei Zhang

Figure 1 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 2 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 3 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 4 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Viaarxiv icon

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Add code
Bookmark button
Alert button
Mar 20, 2024
Bowen Zhang, Tianyu Yang, Yu Li, Lei Zhang, Xi Zhao

Figure 1 for Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
Figure 2 for Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
Figure 3 for Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
Figure 4 for Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
Viaarxiv icon

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Mar 20, 2024
Wenqiao Zhang, Tianwei Lin, Jiang Liu, Fangxun Shu, Haoyuan Li, Lei Zhang, He Wanggui, Hao Zhou, Zheqi Lv, Hao Jiang, Juncheng Li, Siliang Tang, Yueting Zhuang

Figure 1 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Figure 2 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Figure 3 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Figure 4 for HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Viaarxiv icon

TAPTR: Tracking Any Point with Transformers as Detection

Add code
Bookmark button
Alert button
Mar 19, 2024
Hongyang Li, Hao Zhang, Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Lei Zhang

Figure 1 for TAPTR: Tracking Any Point with Transformers as Detection
Figure 2 for TAPTR: Tracking Any Point with Transformers as Detection
Figure 3 for TAPTR: Tracking Any Point with Transformers as Detection
Figure 4 for TAPTR: Tracking Any Point with Transformers as Detection
Viaarxiv icon

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

Add code
Bookmark button
Alert button
Mar 18, 2024
Bojia Zi, Shihao Zhao, Xianbiao Qi, Jianan Wang, Yukai Shi, Qianyu Chen, Bin Liang, Kam-Fai Wong, Lei Zhang

Figure 1 for CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Figure 2 for CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Figure 3 for CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Figure 4 for CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Viaarxiv icon

Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM

Add code
Bookmark button
Alert button
Mar 18, 2024
Linyu Tang, Lei Zhang

Figure 1 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Figure 2 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Figure 3 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Figure 4 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Viaarxiv icon

Self-Supervised Video Desmoking for Laparoscopic Surgery

Add code
Bookmark button
Alert button
Mar 17, 2024
Renlong Wu, Zhilu Zhang, Shuohao Zhang, Longfei Gou, Haobin Chen, Lei Zhang, Hao Chen, Wangmeng Zuo

Figure 1 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Figure 2 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Figure 3 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Figure 4 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Viaarxiv icon

Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Add code
Bookmark button
Alert button
Mar 17, 2024
Ruibin Li, Ruihuang Li, Song Guo, Lei Zhang

Figure 1 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Figure 2 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Figure 3 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Figure 4 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Viaarxiv icon

Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription

Add code
Bookmark button
Alert button
Mar 16, 2024
Hongxiang Zhao, Xili Dai, Jianan Wang, Shengbang Tong, Jingyuan Zhang, Weida Wang, Lei Zhang, Yi Ma

Figure 1 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 2 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 3 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 4 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Viaarxiv icon

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Add code
Bookmark button
Alert button
Mar 16, 2024
Tianhe Wu, Kede Ma, Jie Liang, Yujiu Yang, Lei Zhang

Figure 1 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Figure 2 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Figure 3 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Figure 4 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Viaarxiv icon