Picture for Yusheng Xie

Yusheng Xie

LaTr: Layout-Aware Transformer for Scene-Text VQA

Add code
Dec 24, 2021
Figure 1 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 2 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 3 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 4 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Viaarxiv icon

TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation

Add code
Oct 29, 2021
Figure 1 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Figure 2 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Figure 3 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Figure 4 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Viaarxiv icon

DocFormer: End-to-End Transformer for Document Understanding

Add code
Jun 22, 2021
Viaarxiv icon

MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation

Add code
Dec 06, 2020
Figure 1 for MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation
Figure 2 for MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation
Figure 3 for MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation
Figure 4 for MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation
Viaarxiv icon

Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos

Add code
Dec 06, 2020
Figure 1 for Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos
Figure 2 for Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos
Figure 3 for Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos
Figure 4 for Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos
Viaarxiv icon

DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation

Add code
Dec 06, 2020
Figure 1 for DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation
Figure 2 for DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation
Figure 3 for DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation
Figure 4 for DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation
Viaarxiv icon

Towards Good Practices in Self-supervised Representation Learning

Add code
Dec 01, 2020
Figure 1 for Towards Good Practices in Self-supervised Representation Learning
Figure 2 for Towards Good Practices in Self-supervised Representation Learning
Figure 3 for Towards Good Practices in Self-supervised Representation Learning
Figure 4 for Towards Good Practices in Self-supervised Representation Learning
Viaarxiv icon

MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis

Add code
Oct 02, 2020
Figure 1 for MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis
Figure 2 for MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis
Figure 3 for MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis
Figure 4 for MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis
Viaarxiv icon

Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation

Add code
Nov 27, 2018
Figure 1 for Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation
Figure 2 for Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation
Figure 3 for Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation
Figure 4 for Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation
Viaarxiv icon

On the Generation of Medical Question-Answer Pairs

Add code
Nov 01, 2018
Figure 1 for On the Generation of Medical Question-Answer Pairs
Figure 2 for On the Generation of Medical Question-Answer Pairs
Figure 3 for On the Generation of Medical Question-Answer Pairs
Figure 4 for On the Generation of Medical Question-Answer Pairs
Viaarxiv icon