Alert button

"Text": models, code, and papers
Alert button

TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval

Sep 28, 2022
Xiaohan Zou, Changqiao Wu, Lele Cheng, Zhongyuan Wang

Figure 1 for TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval
Figure 2 for TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval
Figure 3 for TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval
Figure 4 for TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval
Viaarxiv icon

CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge

Nov 17, 2022
Linli Yao, Weijing Chen, Qin Jin

Figure 1 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 2 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 3 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 4 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Viaarxiv icon

End-to-end Clinical Event Extraction from Chinese Electronic Health Record

Aug 19, 2022
Wei Feng, Ruochen Huang, Yun Yu, Huiting Sun, Yun Liu

Figure 1 for End-to-end Clinical Event Extraction from Chinese Electronic Health Record
Figure 2 for End-to-end Clinical Event Extraction from Chinese Electronic Health Record
Figure 3 for End-to-end Clinical Event Extraction from Chinese Electronic Health Record
Figure 4 for End-to-end Clinical Event Extraction from Chinese Electronic Health Record
Viaarxiv icon

Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates

Nov 14, 2022
Etienne Labbé, Thomas Pellegrini, Julien Pinquier

Figure 1 for Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates
Figure 2 for Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates
Figure 3 for Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates
Figure 4 for Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates
Viaarxiv icon

A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Oct 18, 2022
Kunbo Ding, Weijie Liu, Yuejian Fang, Weiquan Mao, Zhe Zhao, Tao Zhu, Haoyan Liu, Rong Tian, Yiren Chen

Figure 1 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning
Figure 2 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning
Figure 3 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning
Figure 4 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning
Viaarxiv icon

Efficient Speech Translation with Dynamic Latent Perceivers

Oct 28, 2022
Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussá

Figure 1 for Efficient Speech Translation with Dynamic Latent Perceivers
Figure 2 for Efficient Speech Translation with Dynamic Latent Perceivers
Figure 3 for Efficient Speech Translation with Dynamic Latent Perceivers
Figure 4 for Efficient Speech Translation with Dynamic Latent Perceivers
Viaarxiv icon

Arabic Text-To-Speech (TTS) Data Preparation

Apr 07, 2022
Hala Al Masri, Muhy Eddin Za'ter

Figure 1 for Arabic Text-To-Speech (TTS) Data Preparation
Viaarxiv icon

OLGA : An Ontology and LSTM-based approach for generating Arithmetic Word Problems (AWPs) of transfer type

Nov 22, 2022
Suresh Kumar, P Sreenivasa Kumar

Figure 1 for OLGA : An Ontology and LSTM-based approach for generating Arithmetic Word Problems (AWPs) of transfer type
Figure 2 for OLGA : An Ontology and LSTM-based approach for generating Arithmetic Word Problems (AWPs) of transfer type
Figure 3 for OLGA : An Ontology and LSTM-based approach for generating Arithmetic Word Problems (AWPs) of transfer type
Figure 4 for OLGA : An Ontology and LSTM-based approach for generating Arithmetic Word Problems (AWPs) of transfer type
Viaarxiv icon

A Deep Double Ritz Method for solving Partial Differential Equations

Nov 07, 2022
Carlos Uriarte, David Pardo, Ignacio Muga, Judit Muñoz-Matute

Figure 1 for A Deep Double Ritz Method for solving Partial Differential Equations
Figure 2 for A Deep Double Ritz Method for solving Partial Differential Equations
Figure 3 for A Deep Double Ritz Method for solving Partial Differential Equations
Figure 4 for A Deep Double Ritz Method for solving Partial Differential Equations
Viaarxiv icon

Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities

Oct 30, 2022
Khyathi Raghavi Chandu, Alborz Geramifard

Figure 1 for Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Figure 2 for Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Figure 3 for Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Figure 4 for Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Viaarxiv icon