Alert button
Picture for Jinhan Wang

Jinhan Wang

Alert button

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Bookmark button
Alert button
Jan 26, 2024
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

Viaarxiv icon

Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals

Add code
Bookmark button
Alert button
Jun 06, 2023
Jinhan Wang, Vijay Ravi, Abeer Alwan

Figure 1 for Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals
Figure 2 for Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals
Figure 3 for Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals
Figure 4 for Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals
Viaarxiv icon

Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR

Add code
Bookmark button
Alert button
Apr 28, 2023
Ruchao Fan, Yunzheng Zhu, Jinhan Wang, Abeer Alwan

Figure 1 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Figure 2 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Figure 3 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Figure 4 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Viaarxiv icon

A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement

Add code
Bookmark button
Alert button
Jun 29, 2022
Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

Figure 1 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 2 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 3 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 4 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Viaarxiv icon

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals

Add code
Bookmark button
Alert button
Jun 27, 2022
Jinhan Wang, Vijay Ravi, Jonathan Flint, Abeer Alwan

Figure 1 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 2 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 3 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 4 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Viaarxiv icon

VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition

Add code
Bookmark button
Alert button
Feb 22, 2022
Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas

Figure 1 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 2 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 3 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 4 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Viaarxiv icon

FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

Add code
Bookmark button
Alert button
Feb 11, 2022
Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

Figure 1 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 2 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 3 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 4 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Viaarxiv icon

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System

Add code
Bookmark button
Alert button
Jun 18, 2021
Jinhan Wang, Yunzheng Zhu, Ruchao Fan, Wei Chu, Abeer Alwan

Figure 1 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 2 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 3 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Viaarxiv icon