Picture for Yishu Miao

Yishu Miao

Scene Text Recognition with Semantics

Add code
Oct 19, 2022
Figure 1 for Scene Text Recognition with Semantics
Figure 2 for Scene Text Recognition with Semantics
Figure 3 for Scene Text Recognition with Semantics
Figure 4 for Scene Text Recognition with Semantics
Viaarxiv icon

Contrastive Video-Language Learning with Fine-grained Frame Sampling

Add code
Oct 10, 2022
Figure 1 for Contrastive Video-Language Learning with Fine-grained Frame Sampling
Figure 2 for Contrastive Video-Language Learning with Fine-grained Frame Sampling
Figure 3 for Contrastive Video-Language Learning with Fine-grained Frame Sampling
Figure 4 for Contrastive Video-Language Learning with Fine-grained Frame Sampling
Viaarxiv icon

Logically Consistent Adversarial Attacks for Soft Theorem Provers

Add code
Apr 29, 2022
Figure 1 for Logically Consistent Adversarial Attacks for Soft Theorem Provers
Figure 2 for Logically Consistent Adversarial Attacks for Soft Theorem Provers
Figure 3 for Logically Consistent Adversarial Attacks for Soft Theorem Provers
Figure 4 for Logically Consistent Adversarial Attacks for Soft Theorem Provers
Viaarxiv icon

Kubric: A scalable dataset generator

Add code
Mar 07, 2022
Figure 1 for Kubric: A scalable dataset generator
Figure 2 for Kubric: A scalable dataset generator
Figure 3 for Kubric: A scalable dataset generator
Figure 4 for Kubric: A scalable dataset generator
Viaarxiv icon

Guiding Visual Question Generation

Add code
Oct 15, 2021
Figure 1 for Guiding Visual Question Generation
Figure 2 for Guiding Visual Question Generation
Figure 3 for Guiding Visual Question Generation
Figure 4 for Guiding Visual Question Generation
Viaarxiv icon

Cross-Modal Generative Augmentation for Visual Question Answering

Add code
May 11, 2021
Figure 1 for Cross-Modal Generative Augmentation for Visual Question Answering
Figure 2 for Cross-Modal Generative Augmentation for Visual Question Answering
Figure 3 for Cross-Modal Generative Augmentation for Visual Question Answering
Figure 4 for Cross-Modal Generative Augmentation for Visual Question Answering
Viaarxiv icon

Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation

Add code
Feb 22, 2021
Figure 1 for Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation
Figure 2 for Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation
Figure 3 for Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation
Figure 4 for Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation
Viaarxiv icon

Latent Variable Models for Visual Question Answering

Add code
Jan 16, 2021
Figure 1 for Latent Variable Models for Visual Question Answering
Figure 2 for Latent Variable Models for Visual Question Answering
Figure 3 for Latent Variable Models for Visual Question Answering
Figure 4 for Latent Variable Models for Visual Question Answering
Viaarxiv icon

Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision

Add code
Nov 19, 2020
Figure 1 for Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision
Figure 2 for Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision
Figure 3 for Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision
Figure 4 for Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision
Viaarxiv icon

Selective Sensor Fusion for Neural Visual-Inertial Odometry

Add code
Mar 04, 2019
Figure 1 for Selective Sensor Fusion for Neural Visual-Inertial Odometry
Figure 2 for Selective Sensor Fusion for Neural Visual-Inertial Odometry
Figure 3 for Selective Sensor Fusion for Neural Visual-Inertial Odometry
Figure 4 for Selective Sensor Fusion for Neural Visual-Inertial Odometry
Viaarxiv icon