Alert button
Picture for Longyin Wen

Longyin Wen

Alert button

Edit3K: Universal Representation Learning for Video Editing Components

Add code
Bookmark button
Alert button
Mar 24, 2024
Xin Gu, Libo Zhang, Fan Chen, Longyin Wen, Yufei Wang, Tiejian Luo, Sijie Zhu

Viaarxiv icon

Accurate and Fast Compressed Video Captioning

Add code
Bookmark button
Alert button
Sep 22, 2023
Yaojie Shen, Xin Gu, Kai Xu, Heng Fan, Longyin Wen, Libo Zhang

Figure 1 for Accurate and Fast Compressed Video Captioning
Figure 2 for Accurate and Fast Compressed Video Captioning
Figure 3 for Accurate and Fast Compressed Video Captioning
Figure 4 for Accurate and Fast Compressed Video Captioning
Viaarxiv icon

Exploring the Role of Audio in Video Captioning

Add code
Bookmark button
Alert button
Jun 21, 2023
Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang

Figure 1 for Exploring the Role of Audio in Video Captioning
Figure 2 for Exploring the Role of Audio in Video Captioning
Figure 3 for Exploring the Role of Audio in Video Captioning
Figure 4 for Exploring the Role of Audio in Video Captioning
Viaarxiv icon

Text with Knowledge Graph Augmented Transformer for Video Captioning

Add code
Bookmark button
Alert button
Mar 25, 2023
Xin Gu, Guang Chen, Yufei Wang, Libo Zhang, Tiejian Luo, Longyin Wen

Figure 1 for Text with Knowledge Graph Augmented Transformer for Video Captioning
Figure 2 for Text with Knowledge Graph Augmented Transformer for Video Captioning
Figure 3 for Text with Knowledge Graph Augmented Transformer for Video Captioning
Figure 4 for Text with Knowledge Graph Augmented Transformer for Video Captioning
Viaarxiv icon

DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training

Add code
Bookmark button
Alert button
Mar 06, 2023
Wei Li, Linchao Zhu, Longyin Wen, Yi Yang

Figure 1 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Figure 2 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Figure 3 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Figure 4 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Viaarxiv icon

Dual-Stream Transformer for Generic Event Boundary Captioning

Add code
Bookmark button
Alert button
Jul 07, 2022
Xin Gu, Hanhua Ye, Guang Chen, Yufei Wang, Libo Zhang, Longyin Wen

Figure 1 for Dual-Stream Transformer for Generic Event Boundary Captioning
Figure 2 for Dual-Stream Transformer for Generic Event Boundary Captioning
Figure 3 for Dual-Stream Transformer for Generic Event Boundary Captioning
Figure 4 for Dual-Stream Transformer for Generic Event Boundary Captioning
Viaarxiv icon

SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection

Add code
Bookmark button
Alert button
Jun 25, 2022
Dexiang Hong, Xiaoqi Ma, Xinyao Wang, Congcong Li, Yufei Wang, Longyin Wen

Figure 1 for SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Figure 2 for SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Figure 3 for SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Figure 4 for SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Viaarxiv icon

Structured Context Transformer for Generic Event Boundary Detection

Add code
Bookmark button
Alert button
Jun 07, 2022
Congcong Li, Xinyao Wang, Dexiang Hong, Yufei Wang, Libo Zhang, Tiejian Luo, Longyin Wen

Figure 1 for Structured Context Transformer for Generic Event Boundary Detection
Figure 2 for Structured Context Transformer for Generic Event Boundary Detection
Figure 3 for Structured Context Transformer for Generic Event Boundary Detection
Figure 4 for Structured Context Transformer for Generic Event Boundary Detection
Viaarxiv icon

End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection

Add code
Bookmark button
Alert button
Mar 29, 2022
Congcong Li, Xinyao Wang, Longyin Wen, Dexiang Hong, Tiejian Luo, Libo Zhang

Figure 1 for End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Figure 2 for End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Figure 3 for End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Figure 4 for End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Viaarxiv icon

Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark

Add code
Bookmark button
Alert button
Aug 16, 2021
Boying Wang, Libo Zhang, Longyin Wen, Xianglong Liu, Yanjun Wu

Figure 1 for Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark
Figure 2 for Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark
Figure 3 for Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark
Figure 4 for Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark
Viaarxiv icon