Alert button

"Text": models, code, and papers
Alert button

Ranking-aware Uncertainty for Text-guided Image Retrieval

Aug 16, 2023
Junyang Chen, Hanjiang Lai

Figure 1 for Ranking-aware Uncertainty for Text-guided Image Retrieval
Figure 2 for Ranking-aware Uncertainty for Text-guided Image Retrieval
Figure 3 for Ranking-aware Uncertainty for Text-guided Image Retrieval
Figure 4 for Ranking-aware Uncertainty for Text-guided Image Retrieval
Viaarxiv icon

Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity

Sep 12, 2023
Joseph Gatto, Omar Sharif, Parker Seegmiller, Philip Bohlman, Sarah Masud Preum

Figure 1 for Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity
Figure 2 for Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity
Figure 3 for Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity
Figure 4 for Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity
Viaarxiv icon

BYOC: Personalized Few-Shot Classification with Co-Authored Class Descriptions

Oct 09, 2023
Arth Bohra, Govert Verkes, Artem Harutyunyan, Pascal Weinberger, Giovanni Campagna

Figure 1 for BYOC: Personalized Few-Shot Classification with Co-Authored Class Descriptions
Figure 2 for BYOC: Personalized Few-Shot Classification with Co-Authored Class Descriptions
Figure 3 for BYOC: Personalized Few-Shot Classification with Co-Authored Class Descriptions
Figure 4 for BYOC: Personalized Few-Shot Classification with Co-Authored Class Descriptions
Viaarxiv icon

Story Visualization by Online Text Augmentation with Context Memory

Aug 15, 2023
Daechul Ahn, Daneul Kim, Gwangmo Song, Seung Hwan Kim, Honglak Lee, Dongyeop Kang, Jonghyun Choi

Figure 1 for Story Visualization by Online Text Augmentation with Context Memory
Figure 2 for Story Visualization by Online Text Augmentation with Context Memory
Figure 3 for Story Visualization by Online Text Augmentation with Context Memory
Figure 4 for Story Visualization by Online Text Augmentation with Context Memory
Viaarxiv icon

SALMONN: Towards Generic Hearing Abilities for Large Language Models

Oct 20, 2023
Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

Viaarxiv icon

RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification

Oct 14, 2023
Junjie Ye, Jie Zhou, Junfeng Tian, Rui Wang, Qi Zhang, Tao Gui, Xuanjing Huang

Figure 1 for RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification
Figure 2 for RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification
Figure 3 for RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification
Figure 4 for RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification
Viaarxiv icon

GeRA: Label-Efficient Geometrically Regularized Alignment

Oct 07, 2023
Dustin Klebe, Tal Shnitzer, Mikhail Yurochkin, Leonid Karlinsky, Justin Solomon

Viaarxiv icon

Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment

Aug 16, 2023
Qi Chen, Chaorui Deng, Zixiong Huang, Bowen Zhang, Mingkui Tan, Qi Wu

Figure 1 for Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment
Figure 2 for Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment
Figure 3 for Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment
Figure 4 for Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment
Viaarxiv icon

Do Language Models Learn about Legal Entity Types during Pretraining?

Oct 19, 2023
Claire Barale, Michael Rovatsos, Nehal Bhuta

Figure 1 for Do Language Models Learn about Legal Entity Types during Pretraining?
Figure 2 for Do Language Models Learn about Legal Entity Types during Pretraining?
Figure 3 for Do Language Models Learn about Legal Entity Types during Pretraining?
Figure 4 for Do Language Models Learn about Legal Entity Types during Pretraining?
Viaarxiv icon

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Oct 19, 2023
Xuan Ju, Ailing Zeng, Yuxuan Bian, Shaoteng Liu, Qiang Xu

Figure 1 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 2 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 3 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 4 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Viaarxiv icon