Picture for Libo Zhang

Libo Zhang

National University of Defense Technology, Changsha, China

CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition

Add code
Jan 18, 2024
Viaarxiv icon

Text Region Multiple Information Perception Network for Scene Text Detection

Add code
Jan 18, 2024
Figure 1 for Text Region Multiple Information Perception Network for Scene Text Detection
Figure 2 for Text Region Multiple Information Perception Network for Scene Text Detection
Figure 3 for Text Region Multiple Information Perception Network for Scene Text Detection
Figure 4 for Text Region Multiple Information Perception Network for Scene Text Detection
Viaarxiv icon

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Add code
Jan 16, 2024
Figure 1 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Figure 2 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Figure 3 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Figure 4 for High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
Viaarxiv icon

Context-Guided Spatio-Temporal Video Grounding

Add code
Jan 03, 2024
Figure 1 for Context-Guided Spatio-Temporal Video Grounding
Figure 2 for Context-Guided Spatio-Temporal Video Grounding
Figure 3 for Context-Guided Spatio-Temporal Video Grounding
Figure 4 for Context-Guided Spatio-Temporal Video Grounding
Viaarxiv icon

Flow-Guided Diffusion for Video Inpainting

Add code
Nov 26, 2023
Figure 1 for Flow-Guided Diffusion for Video Inpainting
Figure 2 for Flow-Guided Diffusion for Video Inpainting
Figure 3 for Flow-Guided Diffusion for Video Inpainting
Figure 4 for Flow-Guided Diffusion for Video Inpainting
Viaarxiv icon

Local Compressed Video Stream Learning for Generic Event Boundary Detection

Add code
Sep 27, 2023
Figure 1 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Figure 2 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Figure 3 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Figure 4 for Local Compressed Video Stream Learning for Generic Event Boundary Detection
Viaarxiv icon

Accurate and Fast Compressed Video Captioning

Add code
Sep 22, 2023
Viaarxiv icon

Collaborative Three-Stream Transformers for Video Captioning

Add code
Sep 18, 2023
Figure 1 for Collaborative Three-Stream Transformers for Video Captioning
Figure 2 for Collaborative Three-Stream Transformers for Video Captioning
Figure 3 for Collaborative Three-Stream Transformers for Video Captioning
Figure 4 for Collaborative Three-Stream Transformers for Video Captioning
Viaarxiv icon

Unsupervised Domain Adaptive Detection with Network Stability Analysis

Add code
Aug 16, 2023
Viaarxiv icon

AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes

Add code
Aug 15, 2023
Figure 1 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Figure 2 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Figure 3 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Figure 4 for AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes
Viaarxiv icon