Alert button

"Text": models, code, and papers
Alert button

Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China

Jan 20, 2021
Heng Zhang, Chengzhi Zhang

Figure 1 for Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China
Figure 2 for Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China
Figure 3 for Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China
Figure 4 for Using Full-text Content of Academic Articles to Build a Methodology Taxonomy of Information Science in China
Viaarxiv icon

MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation

Oct 22, 2020
Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang

Figure 1 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 2 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 3 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 4 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Viaarxiv icon

OutfitTransformer: Learning Outfit Representations for Fashion Recommendation

Apr 11, 2022
Rohan Sarkar, Navaneeth Bodla, Mariya Vasileva, Yen-Liang Lin, Anurag Beniwal, Alan Lu, Gerard Medioni

Figure 1 for OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Figure 2 for OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Figure 3 for OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Figure 4 for OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Viaarxiv icon

MURAL: Multimodal, Multitask Retrieval Across Languages

Sep 10, 2021
Aashi Jain, Mandy Guo, Krishna Srinivasan, Ting Chen, Sneha Kudugunta, Chao Jia, Yinfei Yang, Jason Baldridge

Figure 1 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 2 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 3 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 4 for MURAL: Multimodal, Multitask Retrieval Across Languages
Viaarxiv icon

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

Jan 08, 2021
Chenyang Gao, Guanyu Cai, Xinyang Jiang, Feng Zheng, Jun Zhang, Yifei Gong, Pai Peng, Xiaowei Guo, Xing Sun

Figure 1 for Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search
Figure 2 for Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search
Figure 3 for Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search
Figure 4 for Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search
Viaarxiv icon

Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling

Apr 01, 2021
Qing He, Zhiping Xiu, Thilo Koehler, Jilong Wu

Figure 1 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 2 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 3 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 4 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Viaarxiv icon

Evaluating Pretrained Transformer Models for Entity Linking in Task-Oriented Dialog

Dec 15, 2021
Sai Muralidhar Jayanthi, Varsha Embar, Karthik Raghunathan

Figure 1 for Evaluating Pretrained Transformer Models for Entity Linking in Task-Oriented Dialog
Figure 2 for Evaluating Pretrained Transformer Models for Entity Linking in Task-Oriented Dialog
Figure 3 for Evaluating Pretrained Transformer Models for Entity Linking in Task-Oriented Dialog
Viaarxiv icon

SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

May 27, 2020
Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Fei Wu, Futai Zou

Figure 1 for SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition
Figure 2 for SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition
Figure 3 for SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition
Figure 4 for SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition
Viaarxiv icon

Leveraging Disentangled Representations to Improve Vision-Based Keystroke Inference Attacks Under Low Data

Apr 05, 2022
John Lim, Jan-Michael Frahm, Fabian Monrose

Figure 1 for Leveraging Disentangled Representations to Improve Vision-Based Keystroke Inference Attacks Under Low Data
Figure 2 for Leveraging Disentangled Representations to Improve Vision-Based Keystroke Inference Attacks Under Low Data
Figure 3 for Leveraging Disentangled Representations to Improve Vision-Based Keystroke Inference Attacks Under Low Data
Figure 4 for Leveraging Disentangled Representations to Improve Vision-Based Keystroke Inference Attacks Under Low Data
Viaarxiv icon

Coordinating Narratives and the Capitol Riots on Parler

Sep 02, 2021
Lynnette Hui Xian Ng, Iain Cruickshank, Kathleen M. Carley

Figure 1 for Coordinating Narratives and the Capitol Riots on Parler
Figure 2 for Coordinating Narratives and the Capitol Riots on Parler
Figure 3 for Coordinating Narratives and the Capitol Riots on Parler
Figure 4 for Coordinating Narratives and the Capitol Riots on Parler
Viaarxiv icon