Picture for Shiwan Zhao

Shiwan Zhao

PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge

Add code
Sep 07, 2024
Figure 1 for PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge
Figure 2 for PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge
Figure 3 for PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge
Figure 4 for PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge
Viaarxiv icon

Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation

Add code
Sep 05, 2024
Figure 1 for Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation
Figure 2 for Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation
Figure 3 for Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation
Figure 4 for Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation
Viaarxiv icon

Uncertainty-Aware Mean Opinion Score Prediction

Add code
Aug 23, 2024
Viaarxiv icon

Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives

Add code
Aug 13, 2024
Figure 1 for Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives
Figure 2 for Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives
Figure 3 for Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives
Figure 4 for Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives
Viaarxiv icon

Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition

Add code
Aug 01, 2024
Figure 1 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 2 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 3 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Figure 4 for Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition
Viaarxiv icon

Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation

Add code
Jul 26, 2024
Figure 1 for Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
Figure 2 for Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
Figure 3 for Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
Figure 4 for Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
Viaarxiv icon

Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

Add code
Jul 12, 2024
Figure 1 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 2 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 3 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Figure 4 for Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Viaarxiv icon

Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs

Add code
Jul 12, 2024
Figure 1 for Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Figure 2 for Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Figure 3 for Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Figure 4 for Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Viaarxiv icon

Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores

Add code
Jun 06, 2024
Figure 1 for Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Figure 2 for Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Figure 3 for Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Figure 4 for Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Viaarxiv icon

kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

Add code
Dec 21, 2023
Viaarxiv icon