Alert button

"Text": models, code, and papers
Alert button

Long-Range Transformer Architectures for Document Understanding

Sep 11, 2023
Thibault Douzon, Stefan Duffner, Christophe Garcia, Jérémy Espinas

Viaarxiv icon

AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer

Sep 22, 2023
Leixin Yang, Yaping Zhang, Haoyu Xiong, Yu Xiang

Figure 1 for AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer
Figure 2 for AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer
Figure 3 for AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer
Figure 4 for AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer
Viaarxiv icon

Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving

Sep 28, 2023
Sumit Kumar Jha, Susmit Jha, Patrick Lincoln, Nathaniel D. Bastian, Alvaro Velasquez, Rickard Ewetz, Sandeep Neema

Figure 1 for Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving
Figure 2 for Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving
Figure 3 for Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving
Figure 4 for Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving
Viaarxiv icon

Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos

Sep 14, 2023
Fen Fang, Yun Liu, Ali Koksal, Qianli Xu, Joo-Hwee Lim

Figure 1 for Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos
Figure 2 for Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos
Figure 3 for Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos
Figure 4 for Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos
Viaarxiv icon

PatFig: Generating Short and Long Captions for Patent Figures

Sep 15, 2023
Dana Aubakirova, Kim Gerdes, Lufei Liu

Viaarxiv icon

AlbNER: A Corpus for Named Entity Recognition in Albanian

Sep 15, 2023
Erion Çano

Viaarxiv icon

Multilingual context-based pronunciation learning for Text-to-Speech

Jul 31, 2023
Giulia Comini, Manuel Sam Ribeiro, Fan Yang, Heereen Shim, Jaime Lorenzo-Trueba

Figure 1 for Multilingual context-based pronunciation learning for Text-to-Speech
Figure 2 for Multilingual context-based pronunciation learning for Text-to-Speech
Figure 3 for Multilingual context-based pronunciation learning for Text-to-Speech
Figure 4 for Multilingual context-based pronunciation learning for Text-to-Speech
Viaarxiv icon

Calibrating LLM-Based Evaluator

Sep 23, 2023
Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Viaarxiv icon

Dream the Impossible: Outlier Imagination with Diffusion Models

Sep 23, 2023
Xuefeng Du, Yiyou Sun, Xiaojin Zhu, Yixuan Li

Viaarxiv icon

Contrastive Feature Masking Open-Vocabulary Vision Transformer

Sep 02, 2023
Dahun Kim, Anelia Angelova, Weicheng Kuo

Figure 1 for Contrastive Feature Masking Open-Vocabulary Vision Transformer
Figure 2 for Contrastive Feature Masking Open-Vocabulary Vision Transformer
Figure 3 for Contrastive Feature Masking Open-Vocabulary Vision Transformer
Figure 4 for Contrastive Feature Masking Open-Vocabulary Vision Transformer
Viaarxiv icon