Picture for Hossein Zeinali

Hossein Zeinali

Amirkabir University of Technology

Leveraging Visemes for Better Visual Speech Representation and Lip Reading

Add code
Jul 19, 2023
Figure 1 for Leveraging Visemes for Better Visual Speech Representation and Lip Reading
Figure 2 for Leveraging Visemes for Better Visual Speech Representation and Lip Reading
Figure 3 for Leveraging Visemes for Better Visual Speech Representation and Lip Reading
Viaarxiv icon

Word-level Persian Lipreading Dataset

Add code
Apr 08, 2023
Figure 1 for Word-level Persian Lipreading Dataset
Figure 2 for Word-level Persian Lipreading Dataset
Figure 3 for Word-level Persian Lipreading Dataset
Figure 4 for Word-level Persian Lipreading Dataset
Viaarxiv icon

ArmanTTS single-speaker Persian dataset

Add code
Apr 07, 2023
Figure 1 for ArmanTTS single-speaker Persian dataset
Figure 2 for ArmanTTS single-speaker Persian dataset
Figure 3 for ArmanTTS single-speaker Persian dataset
Figure 4 for ArmanTTS single-speaker Persian dataset
Viaarxiv icon

A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset

Add code
Jan 21, 2023
Figure 1 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Figure 2 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Figure 3 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Figure 4 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Viaarxiv icon

ArmanEmo: A Persian Dataset for Text-based Emotion Detection

Add code
Jul 24, 2022
Figure 1 for ArmanEmo: A Persian Dataset for Text-based Emotion Detection
Figure 2 for ArmanEmo: A Persian Dataset for Text-based Emotion Detection
Figure 3 for ArmanEmo: A Persian Dataset for Text-based Emotion Detection
Figure 4 for ArmanEmo: A Persian Dataset for Text-based Emotion Detection
Viaarxiv icon

Lip reading using external viseme decoding

Add code
Apr 10, 2021
Figure 1 for Lip reading using external viseme decoding
Figure 2 for Lip reading using external viseme decoding
Figure 3 for Lip reading using external viseme decoding
Figure 4 for Lip reading using external viseme decoding
Viaarxiv icon

Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan

Add code
Jan 10, 2020
Figure 1 for Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan
Viaarxiv icon

A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database

Add code
Dec 08, 2019
Figure 1 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 2 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 3 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 4 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Viaarxiv icon

BUT System Description to VoxCeleb Speaker Recognition Challenge 2019

Add code
Oct 16, 2019
Figure 1 for BUT System Description to VoxCeleb Speaker Recognition Challenge 2019
Figure 2 for BUT System Description to VoxCeleb Speaker Recognition Challenge 2019
Figure 3 for BUT System Description to VoxCeleb Speaker Recognition Challenge 2019
Viaarxiv icon

BUT VOiCES 2019 System Description

Add code
Jul 13, 2019
Figure 1 for BUT VOiCES 2019 System Description
Figure 2 for BUT VOiCES 2019 System Description
Figure 3 for BUT VOiCES 2019 System Description
Viaarxiv icon