Alert button

"Text": models, code, and papers
Alert button

Adaptive and Personalized Exercise Generation for Online Language Learning

Jun 04, 2023
Peng Cui, Mrinmaya Sachan

Figure 1 for Adaptive and Personalized Exercise Generation for Online Language Learning
Figure 2 for Adaptive and Personalized Exercise Generation for Online Language Learning
Figure 3 for Adaptive and Personalized Exercise Generation for Online Language Learning
Figure 4 for Adaptive and Personalized Exercise Generation for Online Language Learning
Viaarxiv icon

Designing an Encoder for Fast Personalization of Text-to-Image Models

Feb 23, 2023
Rinon Gal, Moab Arar, Yuval Atzmon, Amit H. Bermano, Gal Chechik, Daniel Cohen-Or

Figure 1 for Designing an Encoder for Fast Personalization of Text-to-Image Models
Figure 2 for Designing an Encoder for Fast Personalization of Text-to-Image Models
Figure 3 for Designing an Encoder for Fast Personalization of Text-to-Image Models
Figure 4 for Designing an Encoder for Fast Personalization of Text-to-Image Models
Viaarxiv icon

ICDAR 2023 Competition on Reading the Seal Title

Apr 24, 2023
Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Reading the Seal Title
Figure 2 for ICDAR 2023 Competition on Reading the Seal Title
Figure 3 for ICDAR 2023 Competition on Reading the Seal Title
Figure 4 for ICDAR 2023 Competition on Reading the Seal Title
Viaarxiv icon

GLIGEN: Open-Set Grounded Text-to-Image Generation

Jan 17, 2023
Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee

Figure 1 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 2 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 3 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 4 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Viaarxiv icon

Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables

Apr 28, 2023
Matthias Urban, Carsten Binnig

Figure 1 for Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Figure 2 for Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Figure 3 for Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Figure 4 for Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Viaarxiv icon

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

May 11, 2023
Peng Li, Tianxiang Sun, Qiong Tang, Hang Yan, Yuanbin Wu, Xuanjing Huang, Xipeng Qiu

Figure 1 for CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
Figure 2 for CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
Figure 3 for CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
Figure 4 for CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
Viaarxiv icon

Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching

Mar 01, 2023
Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Zhongtian Du

Figure 1 for Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching
Figure 2 for Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching
Figure 3 for Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching
Figure 4 for Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching
Viaarxiv icon

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

May 15, 2023
Yuyang Zhao, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee

Figure 1 for Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Figure 2 for Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Figure 3 for Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Figure 4 for Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Viaarxiv icon

Evaluating the Social Impact of Generative AI Systems in Systems and Society

Jun 12, 2023
Irene Solaiman, Zeerak Talat, William Agnew, Lama Ahmad, Dylan Baker, Su Lin Blodgett, Hal Daumé III, Jesse Dodge, Ellie Evans, Sara Hooker, Yacine Jernite, Alexandra Sasha Luccioni, Alberto Lusoli, Margaret Mitchell, Jessica Newman, Marie-Therese Png, Andrew Strait, Apostol Vassilev

Viaarxiv icon

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Jun 12, 2023
Hang Zhang, Xin Li, Lidong Bing

Figure 1 for Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Figure 2 for Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Figure 3 for Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Figure 4 for Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Viaarxiv icon