Alert button
Picture for Jinrui Zhang

Jinrui Zhang

Alert button

Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views

Add code
Bookmark button
Alert button
Sep 21, 2023
Taeho Kang, Kyungjin Lee, Jinrui Zhang, Youngki Lee

Figure 1 for Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views
Figure 2 for Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views
Figure 3 for Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views
Figure 4 for Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views
Viaarxiv icon

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Add code
Bookmark button
Alert button
Jul 31, 2023
Junjie Fei, Teng Wang, Jinrui Zhang, Zhenyu He, Chengjie Wang, Feng Zheng

Figure 1 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 2 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 3 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 4 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Viaarxiv icon

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning

Add code
Bookmark button
Alert button
Jun 17, 2023
Yunlong Tang, Jinrui Zhang, Xiangchen Wang, Teng Wang, Feng Zheng

Figure 1 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 2 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 3 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 4 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Viaarxiv icon

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Add code
Bookmark button
Alert button
May 08, 2023
Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao

Figure 1 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 2 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 3 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 4 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Viaarxiv icon

Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

Add code
Bookmark button
Alert button
Mar 11, 2023
Teng Wang, Jinrui Zhang, Feng Zheng, Wenhao Jiang, Ran Cheng, Ping Luo

Figure 1 for Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Figure 2 for Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Figure 3 for Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Figure 4 for Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Viaarxiv icon

Exploiting Context Information for Generic Event Boundary Captioning

Add code
Bookmark button
Alert button
Jul 03, 2022
Jinrui Zhang, Teng Wang, Feng Zheng, Ran Cheng, Ping Luo

Figure 1 for Exploiting Context Information for Generic Event Boundary Captioning
Figure 2 for Exploiting Context Information for Generic Event Boundary Captioning
Viaarxiv icon