Alert button

"Text": models, code, and papers
Alert button

Entity Recognition from Colloquial Text

Jan 09, 2024
Tamara Babaian, Jennifer Xu

Viaarxiv icon

Using Large Language Model for End-to-End Chinese ASR and NER

Jan 21, 2024
Yuang Li, Jiawei Yu, Yanqing Zhao, Min Zhang, Mengxin Ren, Xiaofeng Zhao, Xiaosong Qiao, Chang Su, Miaomiao Ma, Hao Yang

Viaarxiv icon

Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation

Dec 25, 2023
Bingzhi Liu, Yin Cao, Haohe Liu, Yi Zhou

Viaarxiv icon

A survey on recent advances in named entity recognition

Jan 19, 2024
Imed Keraghel, Stanislas Morbieu, Mohamed Nadif

Viaarxiv icon

Compositional Generalization for Multi-label Text Classification: A Data-Augmentation Approach

Dec 20, 2023
Yuyang Chai, Zhuang Li, Jiahui Liu, Lei Chen, Fei Li, Donghong Ji, Chong Teng

Viaarxiv icon

Community-based Behavioral Understanding of Crisis Activity Concerns using Social Media Data: A Study on the 2023 Canadian Wildfires in New York City

Jan 22, 2024
Khondhaker Al Momin, Md Sami Hasnine, Arif Mohaimin Sadri

Viaarxiv icon

Leveraging Social Media Data to Identify Factors Influencing Public Attitude Towards Accessibility, Socioeconomic Disparity and Public Transportation

Jan 22, 2024
Khondhaker Al Momin, Arif Mohaimin Sadri, Md Sami Hasnine

Viaarxiv icon

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

Dec 19, 2023
Xiaomeng Yang, Zhi Qiao, Yu Zhou, Weiping Wang

Viaarxiv icon

CLAPP: Contrastive Language-Audio Pre-training in Passive Underwater Vessel Classification

Jan 15, 2024
Zeyu Li, Jingsheng Gao, Tong Yu, Suncheng Xiang, Jiacheng Ruan, Ting Liu, Yuzhuo Fu

Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Jan 15, 2024
Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai

Viaarxiv icon