Picture for Joseph Keshet

Joseph Keshet

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Add code
Feb 17, 2026
Viaarxiv icon

Analyzing and Guiding Zero-Shot Posterior Sampling in Diffusion Models

Add code
Feb 07, 2026
Viaarxiv icon

CarelessWhisper: Turning Whisper into a Causal Streaming Model

Add code
Aug 17, 2025
Viaarxiv icon

How Does a Deep Neural Network Look at Lexical Stress?

Add code
Aug 10, 2025
Viaarxiv icon

Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices

Add code
Aug 06, 2025
Figure 1 for Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
Figure 2 for Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
Figure 3 for Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
Figure 4 for Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
Viaarxiv icon

UmbraTTS: Adapting Text-to-Speech to Environmental Contexts with Flow Matching

Add code
Jun 11, 2025
Viaarxiv icon

FlowTSE: Target Speaker Extraction with Flow Matching

Add code
May 20, 2025
Viaarxiv icon

Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR

Add code
Sep 24, 2024
Figure 1 for Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR
Figure 2 for Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR
Figure 3 for Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR
Figure 4 for Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR
Viaarxiv icon

WhisperNER: Unified Open Named Entity and Speech Recognition

Add code
Sep 12, 2024
Figure 1 for WhisperNER: Unified Open Named Entity and Speech Recognition
Figure 2 for WhisperNER: Unified Open Named Entity and Speech Recognition
Figure 3 for WhisperNER: Unified Open Named Entity and Speech Recognition
Figure 4 for WhisperNER: Unified Open Named Entity and Speech Recognition
Viaarxiv icon

HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing

Add code
Jul 10, 2024
Viaarxiv icon