Alert button

"Text": models, code, and papers
Alert button

SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training

Nov 22, 2022
Yuanze Lin, Chen Wei, Huiyu Wang, Alan Yuille, Cihang Xie

Figure 1 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Figure 2 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Figure 3 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Figure 4 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Viaarxiv icon

$S^2$-Flow: Joint Semantic and Style Editing of Facial Images

Nov 22, 2022
Krishnakant Singh, Simone Schaub-Meyer, Stefan Roth

Figure 1 for $S^2$-Flow: Joint Semantic and Style Editing of Facial Images
Figure 2 for $S^2$-Flow: Joint Semantic and Style Editing of Facial Images
Figure 3 for $S^2$-Flow: Joint Semantic and Style Editing of Facial Images
Figure 4 for $S^2$-Flow: Joint Semantic and Style Editing of Facial Images
Viaarxiv icon

PESE: Event Structure Extraction using Pointer Network based Encoder-Decoder Architecture

Nov 22, 2022
Alapan Kuila, Sudeshan Sarkar

Figure 1 for PESE: Event Structure Extraction using Pointer Network based Encoder-Decoder Architecture
Figure 2 for PESE: Event Structure Extraction using Pointer Network based Encoder-Decoder Architecture
Figure 3 for PESE: Event Structure Extraction using Pointer Network based Encoder-Decoder Architecture
Figure 4 for PESE: Event Structure Extraction using Pointer Network based Encoder-Decoder Architecture
Viaarxiv icon

Image Semantic Relation Generation

Oct 19, 2022
Mingzhe Du

Figure 1 for Image Semantic Relation Generation
Figure 2 for Image Semantic Relation Generation
Figure 3 for Image Semantic Relation Generation
Figure 4 for Image Semantic Relation Generation
Viaarxiv icon

Language Detoxification with Attribute-Discriminative Latent Space

Oct 19, 2022
Jin Myung Kwak, Minseon Kim, Sung Ju Hwang

Figure 1 for Language Detoxification with Attribute-Discriminative Latent Space
Figure 2 for Language Detoxification with Attribute-Discriminative Latent Space
Figure 3 for Language Detoxification with Attribute-Discriminative Latent Space
Figure 4 for Language Detoxification with Attribute-Discriminative Latent Space
Viaarxiv icon

LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval

Mar 10, 2022
Jie Lei, Xinlei Chen, Ning Zhang, Mengjiao Wang, Mohit Bansal, Tamara L. Berg, Licheng Yu

Figure 1 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 2 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 3 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 4 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Viaarxiv icon

Improving Structured Text Recognition with Regular Expression Biasing

Nov 10, 2021
Baoguang Shi, Wenfeng Cheng, Yijuan Lu, Cha Zhang, Dinei Florencio

Figure 1 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 2 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 3 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 4 for Improving Structured Text Recognition with Regular Expression Biasing
Viaarxiv icon

Ingredient Extraction from Text in the Recipe Domain

Apr 18, 2022
Arkin Dharawat, Chris Doan

Figure 1 for Ingredient Extraction from Text in the Recipe Domain
Figure 2 for Ingredient Extraction from Text in the Recipe Domain
Figure 3 for Ingredient Extraction from Text in the Recipe Domain
Figure 4 for Ingredient Extraction from Text in the Recipe Domain
Viaarxiv icon

Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities

Nov 15, 2022
Siddhartha Datta

Figure 1 for Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities
Figure 2 for Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities
Figure 3 for Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities
Figure 4 for Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities
Viaarxiv icon

Category Theory for Quantum Natural Language Processing

Dec 13, 2022
Alexis Toumi

Viaarxiv icon