Alert button

"Text": models, code, and papers
Alert button

Rank-Aware Negative Training for Semi-Supervised Text Classification

Jun 13, 2023
Ahmed Murtadha, Shengfeng Pan, Wen Bo, Jianlin Su, Xinxin Cao, Wenze Zhang, Yunfeng Liu

Figure 1 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Figure 2 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Figure 3 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Figure 4 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Jun 07, 2023
Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon

Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices

Aug 22, 2023
Elizaveta Kostenok, Daniil Cherniavskii, Alexey Zaytsev

Figure 1 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Figure 2 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Figure 3 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Figure 4 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Viaarxiv icon

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

Aug 22, 2023
Imam Mohammad Zulkarnain, Shayekh Bin Islam, Md. Zami Al Zunaed Farabe, Md. Mehedi Hasan Shawon, Jawaril Munshad Abedin, Beig Rajibul Hasan, Marsia Haque, Istiak Shihab, Syed Mobassir, MD. Nazmuddoha Ansary, Asif Sushmit, Farig Sadeque

Figure 1 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Figure 2 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Figure 3 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Figure 4 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Viaarxiv icon

Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces

Jun 12, 2023
Muhammad Arslan, Christophe Cruz

Figure 1 for Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces
Figure 2 for Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces
Viaarxiv icon

UNITE: A Unified Benchmark for Text-to-SQL Evaluation

May 26, 2023
Wuwei Lan, Zhiguo Wang, Anuj Chauhan, Henghui Zhu, Alexander Li, Jiang Guo, Sheng Zhang, Chung-Wei Hang, Joseph Lilien, Yiqun Hu, Lin Pan, Mingwen Dong, Jun Wang, Jiarong Jiang, Stephen Ash, Vittorio Castelli, Patrick Ng, Bing Xiang

Figure 1 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Figure 2 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Figure 3 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Viaarxiv icon

ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification

May 16, 2023
Junfan Chen, Richong Zhang, Yongyi Mao, Jie Xu

Figure 1 for ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification
Figure 2 for ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification
Figure 3 for ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification
Figure 4 for ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification
Viaarxiv icon

FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference

May 27, 2023
Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui

Figure 1 for FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference
Figure 2 for FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference
Figure 3 for FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference
Figure 4 for FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference
Viaarxiv icon

LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding

Jun 09, 2023
Yi Tu, Ya Guo, Huan Chen, Jinyang Tang

Figure 1 for LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Figure 2 for LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Figure 3 for LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Figure 4 for LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Viaarxiv icon

InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4

Aug 23, 2023
Lai Wei, Zihao Jiang, Weiran Huang, Lichao Sun

Figure 1 for InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
Figure 2 for InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
Figure 3 for InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
Figure 4 for InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
Viaarxiv icon