Alert button

"Text": models, code, and papers
Alert button

Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search

Mar 08, 2023
Guanshuo Wang, Fufu Yu, Junjie Li, Qiong Jia, Shouhong Ding

Figure 1 for Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search
Figure 2 for Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search
Figure 3 for Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search
Figure 4 for Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search
Viaarxiv icon

Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation

Jun 14, 2023
Likang Wu, Zhi Li, Hongke Zhao, Zhefeng Wang, Qi Liu, Baoxing Huai, Nicholas Jing Yuan, Enhong Chen

Figure 1 for Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation
Figure 2 for Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation
Figure 3 for Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation
Figure 4 for Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation
Viaarxiv icon

Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models

Jan 31, 2023
Hila Chefer, Yuval Alaluf, Yael Vinker, Lior Wolf, Daniel Cohen-Or

Figure 1 for Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Figure 2 for Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Figure 3 for Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Figure 4 for Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Viaarxiv icon

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

May 31, 2023
Peter Shaw, Mandar Joshi, James Cohan, Jonathan Berant, Panupong Pasupat, Hexiang Hu, Urvashi Khandelwal, Kenton Lee, Kristina Toutanova

Figure 1 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 2 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 3 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 4 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Viaarxiv icon

Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach

Jun 11, 2023
Bin Hu, Chenyang Zhao, Pu Zhang, Zihao Zhou, Yuanhang Yang, Zenglin Xu, Bin Liu

Figure 1 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Figure 2 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Figure 3 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Figure 4 for Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Viaarxiv icon

Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer

Mar 15, 2023
Serin Yang, Hyunmin Hwang, Jong Chul Ye

Figure 1 for Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer
Figure 2 for Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer
Figure 3 for Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer
Figure 4 for Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer
Viaarxiv icon

Computational thematics: Comparing algorithms for clustering the genres of literary fiction

May 18, 2023
Oleg Sobchuk, Artjoms Šeļa

Figure 1 for Computational thematics: Comparing algorithms for clustering the genres of literary fiction
Figure 2 for Computational thematics: Comparing algorithms for clustering the genres of literary fiction
Figure 3 for Computational thematics: Comparing algorithms for clustering the genres of literary fiction
Figure 4 for Computational thematics: Comparing algorithms for clustering the genres of literary fiction
Viaarxiv icon

Privacy-Preserving Representation Learning for Text-Attributed Networks with Simplicial Complexes

Feb 09, 2023
Huixin Zhan, Victor S. Sheng

Viaarxiv icon

Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments

Feb 10, 2023
Nicolas Gontier, Pau Rodriguez, Issam Laradji, David Vazquez, Christopher Pal

Figure 1 for Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments
Figure 2 for Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments
Figure 3 for Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments
Figure 4 for Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments
Viaarxiv icon

LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models

Jun 15, 2023
Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo

Figure 1 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Figure 2 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Figure 3 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Figure 4 for LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Viaarxiv icon