Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Masking Kernel for Learning Energy-Efficient Speech Representation

Feb 08, 2023

Apiwat Ditthapron, Emmanuel O. Agu, Adam C. Lammert

Figure 1 for Masking Kernel for Learning Energy-Efficient Speech Representation

Figure 2 for Masking Kernel for Learning Energy-Efficient Speech Representation

Figure 3 for Masking Kernel for Learning Energy-Efficient Speech Representation

Figure 4 for Masking Kernel for Learning Energy-Efficient Speech Representation

Share this with someone who'll enjoy it:

Abstract:Modern smartphones are equipped with powerful audio hardware and processors, allowing them to acquire and perform on-device speech processing at high sampling rates. However, energy consumption remains a concern, especially for resource-intensive DNNs. Prior mobile speech processing reduced computational complexity by compacting the model or reducing input dimensions via hyperparameter tuning, which reduced accuracy or required more training iterations. This paper proposes gradient descent for optimizing energy-efficient speech recording format (length and sampling rate). The goal is to reduce the input size, which reduces data collection and inference energy. For a backward pass, a masking function with non-zero derivatives (Gaussian, Hann, and Hamming) is used as a windowing function and a lowpass filter. An energy-efficient penalty is introduced to incentivize the reduction of the input size. The proposed masking outperformed baselines by 8.7% in speaker recognition and traumatic brain injury detection using 49% shorter duration, sampled at a lower frequency.

View paper on

Share this with someone who'll enjoy it:

Title:Masking Kernel for Learning Energy-Efficient Speech Representation

Paper and Code