Alert button

"Text": models, code, and papers
Alert button

Gaussian Adaptive Attention is All You Need: Robust Contextual Representations Across Multiple Modalities

Jan 31, 2024
Georgios Ioannides, Aman Chadha, Aaron Elkins

Viaarxiv icon

Multi-Task Learning for Front-End Text Processing in TTS

Jan 12, 2024
Wonjune Kang, Yun Wang, Shun Zhang, Arthur Hinsvark, Qing He

Viaarxiv icon

FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition

Feb 05, 2024
Xiaohu Huang, Hao Zhou, Kun Yao, Kai Han

Viaarxiv icon

Linguistic features for sentence difficulty prediction in ABSA

Feb 05, 2024
Adrian-Gabriel Chifu, Sébastien Fournier

Viaarxiv icon

EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain

Feb 05, 2024
Wei Zhang, Miaoxin Cai, Tong Zhang, Yin Zhuang, Xuerui Mao

Viaarxiv icon

Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech

Jan 19, 2024
Abhinav Garg, Jiyeon Kim, Sushil Khyalia, Chanwoo Kim, Dhananjaya Gowda

Viaarxiv icon

Explaining latent representations of generative models with large multimodal models

Feb 02, 2024
Mengdan Zhu, Zhenke Liu, Bo Pan, Abhinav Angirekula, Liang Zhao

Viaarxiv icon

UNSEE: Unsupervised Non-contrastive Sentence Embeddings

Feb 02, 2024
Ömer Veysel Çağatan

Viaarxiv icon

Bloom-epistemic and sentiment analysis hierarchical classification in course discussion forums

Jan 26, 2024
H. Toba, Y. T. Hernita, M. Ayub, M. C. Wijanto

Viaarxiv icon

Intensive Vision-guided Network for Radiology Report Generation

Feb 06, 2024
Fudan Zheng, Mengfei Li, Ying Wang, Weijiang Yu, Ruixuan Wang, Zhiguang Chen, Nong Xiao, Yutong Lu

Viaarxiv icon