Alert button

"Text": models, code, and papers
Alert button

ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue

May 23, 2023
Haoqin Tu, Yitong Li, Fei Mi, Zhongliang Yang

Figure 1 for ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
Figure 2 for ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
Figure 3 for ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
Figure 4 for ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
Viaarxiv icon

Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path

May 23, 2023
Zilong Wang, Jingbo Shang

Figure 1 for Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
Figure 2 for Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
Figure 3 for Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
Figure 4 for Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
Viaarxiv icon

Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner

May 19, 2023
Zikang Liu, Sihan Chen, Longteng Guo, Handong Li, Xingjian He, Jing Liu

Figure 1 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Figure 2 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Figure 3 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Figure 4 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Viaarxiv icon

Learning Universal Policies via Text-Guided Video Generation

Feb 02, 2023
Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel

Figure 1 for Learning Universal Policies via Text-Guided Video Generation
Figure 2 for Learning Universal Policies via Text-Guided Video Generation
Figure 3 for Learning Universal Policies via Text-Guided Video Generation
Figure 4 for Learning Universal Policies via Text-Guided Video Generation
Viaarxiv icon

How learners produce data from text in classifying clickbait

Jan 28, 2023
Nicholas J. Horton, Jie Chao, Phebe Palmer, William Finzer

Viaarxiv icon

Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval

Jan 30, 2023
Yizhen Chen, Jie Wang, Lijian Lin, Zhongang Qi, Jin Ma, Ying Shan

Figure 1 for Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Figure 2 for Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Figure 3 for Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Figure 4 for Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Viaarxiv icon

Surfacing Biases in Large Language Models using Contrastive Input Decoding

May 12, 2023
Gal Yona, Or Honovich, Itay Laish, Roee Aharoni

Figure 1 for Surfacing Biases in Large Language Models using Contrastive Input Decoding
Figure 2 for Surfacing Biases in Large Language Models using Contrastive Input Decoding
Figure 3 for Surfacing Biases in Large Language Models using Contrastive Input Decoding
Figure 4 for Surfacing Biases in Large Language Models using Contrastive Input Decoding
Viaarxiv icon

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Jun 02, 2023
Zeqiu Wu, Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf, Hannaneh Hajishirzi

Figure 1 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 2 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 3 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 4 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Viaarxiv icon

AI Imagery and the Overton Window

Jun 02, 2023
Sarah K. Amer

Viaarxiv icon

Text-Visual Prompting for Efficient 2D Temporal Video Grounding

Mar 09, 2023
Yimeng Zhang, Xin Chen, Jinghan Jia, Sijia Liu, Ke Ding

Figure 1 for Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Figure 2 for Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Figure 3 for Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Figure 4 for Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Viaarxiv icon