Picture for Cha Zhang

Cha Zhang

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

Add code
Apr 18, 2021
Figure 1 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 2 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 3 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 4 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Viaarxiv icon

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Add code
Dec 29, 2020
Figure 1 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 2 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 3 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 4 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Viaarxiv icon

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption

Add code
Dec 08, 2020
Figure 1 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 2 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 3 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 4 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Viaarxiv icon

Multimodal active speaker detection and virtual cinematography for video conferencing

Add code
Feb 12, 2020
Figure 1 for Multimodal active speaker detection and virtual cinematography for video conferencing
Figure 2 for Multimodal active speaker detection and virtual cinematography for video conferencing
Figure 3 for Multimodal active speaker detection and virtual cinematography for video conferencing
Figure 4 for Multimodal active speaker detection and virtual cinematography for video conferencing
Viaarxiv icon

Improving the Adversarial Robustness of Transfer Learning via Noisy Feature Distillation

Add code
Feb 07, 2020
Figure 1 for Improving the Adversarial Robustness of Transfer Learning via Noisy Feature Distillation
Figure 2 for Improving the Adversarial Robustness of Transfer Learning via Noisy Feature Distillation
Figure 3 for Improving the Adversarial Robustness of Transfer Learning via Noisy Feature Distillation
Figure 4 for Improving the Adversarial Robustness of Transfer Learning via Noisy Feature Distillation
Viaarxiv icon

LeGR: Filter Pruning via Learned Global Ranking

Add code
Apr 28, 2019
Figure 1 for LeGR: Filter Pruning via Learned Global Ranking
Figure 2 for LeGR: Filter Pruning via Learned Global Ranking
Figure 3 for LeGR: Filter Pruning via Learned Global Ranking
Figure 4 for LeGR: Filter Pruning via Learned Global Ranking
Viaarxiv icon

RePr: Improved Training of Convolutional Filters

Add code
Nov 26, 2018
Figure 1 for RePr: Improved Training of Convolutional Filters
Figure 2 for RePr: Improved Training of Convolutional Filters
Figure 3 for RePr: Improved Training of Convolutional Filters
Figure 4 for RePr: Improved Training of Convolutional Filters
Viaarxiv icon

Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks

Add code
Oct 18, 2018
Figure 1 for Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks
Figure 2 for Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks
Figure 3 for Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks
Figure 4 for Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks
Viaarxiv icon

Orthogonal and Idempotent Transformations for Learning Deep Neural Networks

Add code
Jul 19, 2017
Figure 1 for Orthogonal and Idempotent Transformations for Learning Deep Neural Networks
Figure 2 for Orthogonal and Idempotent Transformations for Learning Deep Neural Networks
Figure 3 for Orthogonal and Idempotent Transformations for Learning Deep Neural Networks
Figure 4 for Orthogonal and Idempotent Transformations for Learning Deep Neural Networks
Viaarxiv icon

Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution

Add code
Sep 24, 2016
Figure 1 for Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution
Figure 2 for Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution
Figure 3 for Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution
Figure 4 for Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution
Viaarxiv icon