Alert button

"Text": models, code, and papers
Alert button

Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

Nov 15, 2022
Kyle Richardson, Ronen Tamari, Oren Sultan, Reut Tsarfaty, Dafna Shahaf, Ashish Sabharwal

Figure 1 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Figure 2 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Figure 3 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Figure 4 for Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Viaarxiv icon

Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features

Oct 17, 2022
Gokul Karthik Kumar, Karthik Nandakumar

Figure 1 for Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features
Figure 2 for Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features
Figure 3 for Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features
Figure 4 for Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features
Viaarxiv icon

Task Residual for Tuning Vision-Language Models

Nov 18, 2022
Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang

Figure 1 for Task Residual for Tuning Vision-Language Models
Figure 2 for Task Residual for Tuning Vision-Language Models
Figure 3 for Task Residual for Tuning Vision-Language Models
Figure 4 for Task Residual for Tuning Vision-Language Models
Viaarxiv icon

Automatic Text Summarization Methods: A Comprehensive Review

Mar 03, 2022
Divakar Yadav, Jalpa Desai, Arun Kumar Yadav

Figure 1 for Automatic Text Summarization Methods: A Comprehensive Review
Figure 2 for Automatic Text Summarization Methods: A Comprehensive Review
Figure 3 for Automatic Text Summarization Methods: A Comprehensive Review
Figure 4 for Automatic Text Summarization Methods: A Comprehensive Review
Viaarxiv icon

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Dec 10, 2022
Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi

Figure 1 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 2 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 3 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 4 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Viaarxiv icon

Graph Learning: A Comprehensive Survey and Future Directions

Dec 17, 2022
Shaopeng Wei, Yu Zhao

Figure 1 for Graph Learning: A Comprehensive Survey and Future Directions
Figure 2 for Graph Learning: A Comprehensive Survey and Future Directions
Figure 3 for Graph Learning: A Comprehensive Survey and Future Directions
Figure 4 for Graph Learning: A Comprehensive Survey and Future Directions
Viaarxiv icon

ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter

Oct 20, 2021
Humen Zhong, Jun Tang, Wenhai Wang, Zhibo Yang, Cong Yao, Tong Lu

Figure 1 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 2 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 3 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 4 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Viaarxiv icon

Imagen Video: High Definition Video Generation with Diffusion Models

Oct 05, 2022
Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans

Figure 1 for Imagen Video: High Definition Video Generation with Diffusion Models
Figure 2 for Imagen Video: High Definition Video Generation with Diffusion Models
Figure 3 for Imagen Video: High Definition Video Generation with Diffusion Models
Figure 4 for Imagen Video: High Definition Video Generation with Diffusion Models
Viaarxiv icon

Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5

Oct 31, 2022
Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Aline Villavicencio, Iryna Gurevych

Figure 1 for Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5
Figure 2 for Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5
Figure 3 for Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5
Figure 4 for Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5
Viaarxiv icon

Does CLIP Bind Concepts? Probing Compositionality in Large Image Models

Dec 20, 2022
Martha Lewis, Qinan Yu, Jack Merullo, Ellie Pavlick

Figure 1 for Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
Figure 2 for Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
Figure 3 for Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
Figure 4 for Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
Viaarxiv icon