Alert button
Picture for Siddique Latif

Siddique Latif

Alert button

Sparks of Large Audio Models: A Survey and Outlook

Add code
Bookmark button
Alert button
Sep 03, 2023
Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Björn W. Schuller

Figure 1 for Sparks of Large Audio Models: A Survey and Outlook
Figure 2 for Sparks of Large Audio Models: A Survey and Outlook
Figure 3 for Sparks of Large Audio Models: A Survey and Outlook
Figure 4 for Sparks of Large Audio Models: A Survey and Outlook
Viaarxiv icon

Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers

Add code
Bookmark button
Alert button
Jul 14, 2023
Syed Aun Muhammad Zaidi, Siddique Latif, Junaid Qadir

Figure 1 for Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers
Figure 2 for Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers
Figure 3 for Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers
Figure 4 for Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers
Viaarxiv icon

Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers

Add code
Bookmark button
Alert button
Jul 12, 2023
Siddique Latif, Muhammad Usama, Mohammad Ibrahim Malik, Björn W. Schuller

Figure 1 for Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
Figure 2 for Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
Figure 3 for Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
Figure 4 for Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
Viaarxiv icon

A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model

Add code
Bookmark button
Alert button
May 19, 2023
Ibrahim Malik, Siddique Latif, Raja Jurdak, Björn Schuller

Figure 1 for A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
Figure 2 for A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
Figure 3 for A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
Figure 4 for A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
Viaarxiv icon

Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing

Add code
Bookmark button
Alert button
May 01, 2023
Ibrahim Malik, Siddique Latif, Sanaullah Manzoor, Muhammad Usama, Junaid Qadir, Raja Jurdak

Figure 1 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Figure 2 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Figure 3 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Figure 4 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Viaarxiv icon

Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices

Add code
Bookmark button
Alert button
Apr 22, 2023
Ahlam Husni Abu Nada, Siddique Latif, Junaid Qadir

Figure 1 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Figure 2 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Figure 3 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Figure 4 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Viaarxiv icon

MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan

Add code
Bookmark button
Alert button
Apr 04, 2023
Muhammad Usman, Azka Rehman, Abdullah Shahid, Siddique Latif, Shi Sub Byon, Sung Hyun Kim, Tariq Mahmood Khan, Yeong Gil Shin

Figure 1 for MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan
Figure 2 for MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan
Figure 3 for MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan
Figure 4 for MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan
Viaarxiv icon

Transformers in Speech Processing: A Survey

Add code
Bookmark button
Alert button
Mar 21, 2023
Siddique Latif, Aun Zaidi, Heriberto Cuayahuitl, Fahad Shamshad, Moazzam Shoukat, Junaid Qadir

Figure 1 for Transformers in Speech Processing: A Survey
Figure 2 for Transformers in Speech Processing: A Survey
Figure 3 for Transformers in Speech Processing: A Survey
Figure 4 for Transformers in Speech Processing: A Survey
Viaarxiv icon