Picture for Libo Zhang

Libo Zhang

National University of Defense Technology, Changsha, China

Text Region Multiple Information Perception Network for Scene Text Detection

Add code
Jan 18, 2024
Figure 1 for Text Region Multiple Information Perception Network for Scene Text Detection
Figure 2 for Text Region Multiple Information Perception Network for Scene Text Detection
Figure 3 for Text Region Multiple Information Perception Network for Scene Text Detection
Figure 4 for Text Region Multiple Information Perception Network for Scene Text Detection
Viaarxiv icon

CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition

Add code
Jan 18, 2024
Viaarxiv icon

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Add code
Jan 16, 2024
Figure 1 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Figure 2 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Figure 3 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Figure 4 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Viaarxiv icon

Context-Guided Spatio-Temporal Video Grounding

Add code
Jan 03, 2024
Figure 1 for Context-Guided Spatio-Temporal Video Grounding
Figure 2 for Context-Guided Spatio-Temporal Video Grounding
Figure 3 for Context-Guided Spatio-Temporal Video Grounding
Figure 4 for Context-Guided Spatio-Temporal Video Grounding
Viaarxiv icon

Flow-Guided Diffusion for Video Inpainting

Add code
Nov 26, 2023
Figure 1 for Flow-Guided Diffusion for Video Inpainting
Figure 2 for Flow-Guided Diffusion for Video Inpainting
Figure 3 for Flow-Guided Diffusion for Video Inpainting
Figure 4 for Flow-Guided Diffusion for Video Inpainting
Viaarxiv icon

Local Compressed Video Stream Learning for Generic Event Boundary Detection

Add code
Sep 27, 2023
Figure 1 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Figure 2 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Figure 3 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Figure 4 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Viaarxiv icon

Accurate and Fast Compressed Video Captioning

Add code
Sep 22, 2023
Viaarxiv icon

Collaborative Three-Stream Transformers for Video Captioning

Add code
Sep 18, 2023
Figure 1 for Collaborative Three-Stream Transformers for Video Captioning
Figure 2 for Collaborative Three-Stream Transformers for Video Captioning
Figure 3 for Collaborative Three-Stream Transformers for Video Captioning
Figure 4 for Collaborative Three-Stream Transformers for Video Captioning
Viaarxiv icon

Unsupervised Domain Adaptive Detection with Network Stability Analysis

Add code
Aug 16, 2023
Viaarxiv icon

AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes

Add code
Aug 15, 2023
Figure 1 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Figure 2 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Figure 3 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Figure 4 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Viaarxiv icon