Alert button

"Text": models, code, and papers
Alert button

Context Perception Parallel Decoder for Scene Text Recognition

Jul 23, 2023
Yongkun Du, Zhineng Chen, Caiyan Jia, Xiaoting Yin, Chenxia Li, Yuning Du, Yu-Gang Jiang

Figure 1 for Context Perception Parallel Decoder for Scene Text Recognition
Figure 2 for Context Perception Parallel Decoder for Scene Text Recognition
Figure 3 for Context Perception Parallel Decoder for Scene Text Recognition
Figure 4 for Context Perception Parallel Decoder for Scene Text Recognition
Viaarxiv icon

GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue

Sep 08, 2023
Yanrui Du, Sendong Zhao, Yuhan Chen, Rai Bai, Jing Liu, Hua Wu, Haifeng Wang, Bing Qin

Figure 1 for GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Figure 2 for GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Figure 3 for GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Figure 4 for GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Viaarxiv icon

Exploring Speech Enhancement for Low-resource Speech Synthesis

Sep 19, 2023
Zhaoheng Ni, Sravya Popuri, Ning Dong, Kohei Saijo, Xiaohui Zhang, Gael Le Lan, Yangyang Shi, Vikas Chandra, Changhan Wang

Figure 1 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 2 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 3 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 4 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Viaarxiv icon

LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models

Sep 19, 2023
Nan Li, Bo Kang, Tijl De Bie

Figure 1 for LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models
Figure 2 for LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models
Figure 3 for LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models
Figure 4 for LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models
Viaarxiv icon

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Jul 10, 2023
Yuwei Guo, Ceyuan Yang, Anyi Rao, Yaohui Wang, Yu Qiao, Dahua Lin, Bo Dai

Figure 1 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Figure 2 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Figure 3 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Figure 4 for AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Viaarxiv icon

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Aug 23, 2023
Jinyi Hu, Yuan Yao, Chongyi Wang, Shan Wang, Yinxu Pan, Qianyu Chen, Tianyu Yu, Hanghao Wu, Yue Zhao, Haoye Zhang, Xu Han, Yankai Lin, Jiao Xue, Dahai Li, Zhiyuan Liu, Maosong Sun

Figure 1 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Figure 2 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Figure 3 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Figure 4 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Viaarxiv icon

Analysing Gender Bias in Text-to-Image Models using Object Detection

Jul 16, 2023
Harvey Mannering

Figure 1 for Analysing Gender Bias in Text-to-Image Models using Object Detection
Figure 2 for Analysing Gender Bias in Text-to-Image Models using Object Detection
Figure 3 for Analysing Gender Bias in Text-to-Image Models using Object Detection
Viaarxiv icon

Text + Sketch: Image Compression at Ultra Low Rates

Jul 04, 2023
Eric Lei, Yiğit Berkay Uslu, Hamed Hassani, Shirin Saeedi Bidokhti

Figure 1 for Text + Sketch: Image Compression at Ultra Low Rates
Figure 2 for Text + Sketch: Image Compression at Ultra Low Rates
Figure 3 for Text + Sketch: Image Compression at Ultra Low Rates
Figure 4 for Text + Sketch: Image Compression at Ultra Low Rates
Viaarxiv icon

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

Jun 29, 2023
Zhengxiang Wang

Figure 1 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 2 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 3 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 4 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Viaarxiv icon

Language Modeling Is Compression

Sep 19, 2023
Grégoire Delétang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein, Christopher Mattern, Jordi Grau-Moya, Li Kevin Wenliang, Matthew Aitchison, Laurent Orseau, Marcus Hutter, Joel Veness

Viaarxiv icon