Picture for Xiaobin Zhu

Xiaobin Zhu

VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation

Add code
May 29, 2025
Viaarxiv icon

DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework

Add code
Mar 19, 2025
Viaarxiv icon

Video-Language Alignment Pre-training via Spatio-Temporal Graph Transformer

Add code
Jul 16, 2024
Figure 1 for Video-Language Alignment Pre-training via Spatio-Temporal Graph Transformer
Figure 2 for Video-Language Alignment Pre-training via Spatio-Temporal Graph Transformer
Figure 3 for Video-Language Alignment Pre-training via Spatio-Temporal Graph Transformer
Figure 4 for Video-Language Alignment Pre-training via Spatio-Temporal Graph Transformer
Viaarxiv icon

Transformer-based Reasoning for Learning Evolutionary Chain of Events on Temporal Knowledge Graph

Add code
May 01, 2024
Viaarxiv icon

Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding

Add code
May 01, 2024
Figure 1 for Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
Figure 2 for Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
Figure 3 for Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
Figure 4 for Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
Viaarxiv icon

Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling

Add code
Jan 08, 2024
Viaarxiv icon

Arbitrary Shape Text Detection via Segmentation with Probability Maps

Add code
Aug 26, 2022
Figure 1 for Arbitrary Shape Text Detection via Segmentation with Probability Maps
Figure 2 for Arbitrary Shape Text Detection via Segmentation with Probability Maps
Figure 3 for Arbitrary Shape Text Detection via Segmentation with Probability Maps
Figure 4 for Arbitrary Shape Text Detection via Segmentation with Probability Maps
Viaarxiv icon

Arbitrary Shape Text Detection via Boundary Transformer

Add code
May 11, 2022
Figure 1 for Arbitrary Shape Text Detection via Boundary Transformer
Figure 2 for Arbitrary Shape Text Detection via Boundary Transformer
Figure 3 for Arbitrary Shape Text Detection via Boundary Transformer
Figure 4 for Arbitrary Shape Text Detection via Boundary Transformer
Viaarxiv icon

Graph Fusion Network for Multi-Oriented Object Detection

Add code
May 07, 2022
Figure 1 for Graph Fusion Network for Multi-Oriented Object Detection
Figure 2 for Graph Fusion Network for Multi-Oriented Object Detection
Figure 3 for Graph Fusion Network for Multi-Oriented Object Detection
Figure 4 for Graph Fusion Network for Multi-Oriented Object Detection
Viaarxiv icon

Towards Open-Set Text Recognition via Label-to-Prototype Learning

Add code
Apr 09, 2022
Figure 1 for Towards Open-Set Text Recognition via Label-to-Prototype Learning
Figure 2 for Towards Open-Set Text Recognition via Label-to-Prototype Learning
Figure 3 for Towards Open-Set Text Recognition via Label-to-Prototype Learning
Figure 4 for Towards Open-Set Text Recognition via Label-to-Prototype Learning
Viaarxiv icon