Alert button
Picture for Bhiksha Raj

Bhiksha Raj

Alert button

Language Technologies Institute, Carnegie Mellon University, Mohammed bin Zayed University of AI

Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session

Add code
Bookmark button
Alert button
Feb 20, 2023
Laurie M. Heller, Benjamin Elizalde, Bhiksha Raj, Soham Deshmuk

Viaarxiv icon

PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement

Add code
Bookmark button
Alert button
Feb 16, 2023
Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Figure 2 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Figure 3 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Viaarxiv icon

TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement

Add code
Bookmark button
Alert button
Feb 16, 2023
Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 2 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 3 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 4 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Viaarxiv icon

SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning

Add code
Bookmark button
Alert button
Jan 26, 2023
Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Bhiksha Raj, Marios Savvides

Figure 1 for SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning
Figure 2 for SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning
Figure 3 for SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning
Figure 4 for SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning
Viaarxiv icon

Understanding Political Polarisation using Language Models: A dataset and method

Add code
Bookmark button
Alert button
Jan 02, 2023
Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo

Figure 1 for Understanding Political Polarisation using Language Models: A dataset and method
Figure 2 for Understanding Political Polarisation using Language Models: A dataset and method
Figure 3 for Understanding Political Polarisation using Language Models: A dataset and method
Figure 4 for Understanding Political Polarisation using Language Models: A dataset and method
Viaarxiv icon

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Add code
Bookmark button
Alert button
Nov 28, 2022
Kashu Yamazaki, Khoa Vo, Sang Truong, Bhiksha Raj, Ngan Le

Figure 1 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Figure 2 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Figure 3 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Figure 4 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Viaarxiv icon

Panoramic Video Salient Object Detection with Ambisonic Audio Guidance

Add code
Bookmark button
Alert button
Nov 26, 2022
Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj

Figure 1 for Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Figure 2 for Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Figure 3 for Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Figure 4 for Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Viaarxiv icon

An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning

Add code
Bookmark button
Alert button
Nov 20, 2022
Hao Chen, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Marios Savvides, Bhiksha Raj

Figure 1 for An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
Figure 2 for An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
Figure 3 for An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
Figure 4 for An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
Viaarxiv icon

Describing emotions with acoustic property prompts for speech emotion recognition

Add code
Bookmark button
Alert button
Nov 14, 2022
Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

Figure 1 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 2 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 3 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 4 for Describing emotions with acoustic property prompts for speech emotion recognition
Viaarxiv icon

XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers

Add code
Bookmark button
Alert button
Oct 29, 2022
Roshan Sharma, Bhiksha Raj

Figure 1 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Figure 2 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Figure 3 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Figure 4 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Viaarxiv icon