Alert button

"Text": models, code, and papers
Alert button

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

Mar 31, 2022
Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang

Figure 1 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Figure 2 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Figure 3 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Figure 4 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Viaarxiv icon

Pushing the performances of ASR models on English and Spanish accents

Dec 22, 2022
Pooja Chitkara, Morgane Riviere, Jade Copet, Frank Zhang, Yatharth Saraf

Figure 1 for Pushing the performances of ASR models on English and Spanish accents
Figure 2 for Pushing the performances of ASR models on English and Spanish accents
Figure 3 for Pushing the performances of ASR models on English and Spanish accents
Figure 4 for Pushing the performances of ASR models on English and Spanish accents
Viaarxiv icon

Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization

Jan 05, 2023
Zhenyuan Lu

Figure 1 for Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Figure 2 for Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Figure 3 for Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Figure 4 for Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Viaarxiv icon

E2E Refined Dataset

Nov 01, 2022
Keisuke Toyama, Katsuhito Sudoh, Satoshi Nakamura

Figure 1 for E2E Refined Dataset
Figure 2 for E2E Refined Dataset
Figure 3 for E2E Refined Dataset
Figure 4 for E2E Refined Dataset
Viaarxiv icon

High-Fidelity Guided Image Synthesis with Latent Diffusion Models

Nov 30, 2022
Jaskirat Singh, Stephen Gould, Liang Zheng

Figure 1 for High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Figure 2 for High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Figure 3 for High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Figure 4 for High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Viaarxiv icon

AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning

Nov 30, 2022
Ling Luo, Chih-Hsuan Wei, Po-Ting Lai, Robert Leaman, Qingyu Chen, Zhiyong Lu

Figure 1 for AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning
Figure 2 for AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning
Figure 3 for AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning
Figure 4 for AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning
Viaarxiv icon

Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention

Nov 21, 2022
Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal

Figure 1 for Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Figure 2 for Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Figure 3 for Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Figure 4 for Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Viaarxiv icon

Learning Object-Language Alignments for Open-Vocabulary Object Detection

Nov 27, 2022
Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai

Figure 1 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Figure 2 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Figure 3 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Figure 4 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Viaarxiv icon

Cluster-based Evaluation of Automatically Generated Text

Jun 01, 2022
Tiago Pimentel, Clara Meister, Ryan Cotterell

Figure 1 for Cluster-based Evaluation of Automatically Generated Text
Figure 2 for Cluster-based Evaluation of Automatically Generated Text
Figure 3 for Cluster-based Evaluation of Automatically Generated Text
Figure 4 for Cluster-based Evaluation of Automatically Generated Text
Viaarxiv icon

To show or not to show: Redacting sensitive text from videos of electronic displays

Aug 19, 2022
Abhishek Mukhopadhyay, Shubham Agarwal, Patrick Dylan Zwick, Pradipta Biswas

Figure 1 for To show or not to show: Redacting sensitive text from videos of electronic displays
Figure 2 for To show or not to show: Redacting sensitive text from videos of electronic displays
Figure 3 for To show or not to show: Redacting sensitive text from videos of electronic displays
Figure 4 for To show or not to show: Redacting sensitive text from videos of electronic displays
Viaarxiv icon