Alert button

"Text": models, code, and papers
Alert button

A text-dependent speaker verification application framework based on Chinese numerical string corpus

Dec 04, 2023
Litong Zheng, Feng Hong, Weijie Xu

Viaarxiv icon

Instruct-Imagen: Image Generation with Multi-modal Instruction

Jan 03, 2024
Hexiang Hu, Kelvin C. K. Chan, Yu-Chuan Su, Wenhu Chen, Yandong Li, Kihyuk Sohn, Yang Zhao, Xue Ben, Boqing Gong, William Cohen, Ming-Wei Chang, Xuhui Jia

Viaarxiv icon

Detours for Navigating Instructional Videos

Jan 03, 2024
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan, Kristen Grauman

Viaarxiv icon

Theory of Hallucinations based on Equivariance

Jan 04, 2024
Hisaichi Shibata

Viaarxiv icon

Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion

Jan 04, 2024
Shangyu Wu, Ying Xiong, Yufei Cui, Xue Liu, Buzhou Tang, Tei-Wei Kuo, Chun Jason Xue

Viaarxiv icon

Compositional Generalization for Data-to-Text Generation

Dec 05, 2023
Xinnuo Xu, Ivan Titov, Mirella Lapata

Viaarxiv icon

Speech and Text-Based Emotion Recognizer

Dec 10, 2023
Varun Sharma

Viaarxiv icon

Large OCR Model:An Empirical Study of Scaling Law for OCR

Jan 02, 2024
Miao Rang, Zhenni Bi, Chuanjian Liu, Yunhe Wang, Kai Han

Viaarxiv icon

Transfer the linguistic representations from TTS to accent conversion with non-parallel data

Jan 07, 2024
Xi Chen, Jiakun Pei, Liumeng Xue, Mingyang Zhang

Viaarxiv icon

Towards Online Sign Language Recognition and Translation

Jan 10, 2024
Ronglai Zuo, Fangyun Wei, Brian Mak

Viaarxiv icon