Picture for Mark Hasegawa-Johnson

Mark Hasegawa-Johnson

That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation

Add code
Oct 21, 2025
Viaarxiv icon

The Interspeech 2025 Speech Accessibility Project Challenge

Add code
Jul 29, 2025
Viaarxiv icon

ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models

Add code
Jul 27, 2025
Viaarxiv icon

ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization

Add code
Jun 12, 2025
Viaarxiv icon

SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization

Add code
Mar 17, 2025
Figure 1 for SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
Figure 2 for SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
Figure 3 for SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
Figure 4 for SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
Viaarxiv icon

Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition

Add code
Jan 25, 2025
Figure 1 for Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition
Figure 2 for Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition
Figure 3 for Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition
Figure 4 for Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition
Viaarxiv icon

R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate

Add code
Oct 21, 2024
Figure 1 for R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate
Figure 2 for R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate
Figure 3 for R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate
Figure 4 for R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate
Viaarxiv icon

Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility

Add code
Sep 29, 2024
Figure 1 for Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility
Figure 2 for Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility
Figure 3 for Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility
Figure 4 for Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility
Viaarxiv icon

Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue

Add code
Sep 07, 2024
Figure 1 for Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Figure 2 for Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Figure 3 for Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Figure 4 for Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Viaarxiv icon

LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition

Add code
Aug 11, 2024
Viaarxiv icon