Alert button

"speech": models, code, and papers
Alert button

MCMChaos: Improvising Rap Music with MCMC Methods and Chaos Theory

Jan 15, 2024
Robert G. Kimelman

Viaarxiv icon

Single-channel speech enhancement using learnable loss mixup

Dec 20, 2023
Oscar Chang, Dung N. Tran, Kazuhito Koishida

Viaarxiv icon

DSNet: Disentangled Siamese Network with Neutral Calibration for Speech Emotion Recognition

Dec 25, 2023
Chengxin Chen, Pengyuan Zhang

Viaarxiv icon

Performance Assessment of ChatGPT vs Bard in Detecting Alzheimer's Dementia

Jan 30, 2024
Balamurali B T, Jer-Ming Chen

Viaarxiv icon

Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement

Dec 14, 2023
George Close, William Ravenscroft, Thomas Hain, Stefan Goetze

Viaarxiv icon

An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition

Dec 06, 2023
Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada

Viaarxiv icon

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

Jan 23, 2024
Prachi Singh, Sriram Ganapathy

Viaarxiv icon

On Robustness to Missing Video for Audiovisual Speech Recognition

Dec 19, 2023
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan

Figure 1 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 2 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 3 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 4 for On Robustness to Missing Video for Audiovisual Speech Recognition
Viaarxiv icon

Embedding-based search in JetBrains IDEs

Add code
Bookmark button
Alert button
Jan 26, 2024
Evgeny Abramov, Nikolai Palchikov

Viaarxiv icon

A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM

Jan 31, 2024
Ahmet Yusuf Alan, Enis Karaarslan, Ömer Aydin

Viaarxiv icon