Alert button
Picture for Sourav Bhattacharya

Sourav Bhattacharya

Alert button

Fast Inference Through The Reuse Of Attention Maps In Diffusion Models

Add code
Bookmark button
Alert button
Dec 13, 2023
Rosco Hunter, Łukasz Dudziak, Mohamed S. Abdelfattah, Abhinav Mehrotra, Sourav Bhattacharya, Hongkai Wen

Viaarxiv icon

Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition

Add code
Bookmark button
Alert button
Jul 12, 2023
Titouan Parcollet, Rogier van Dalen, Shucong Zhang, Sourav Bhattacharya

Figure 1 for Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition
Figure 2 for Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition
Figure 3 for Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition
Figure 4 for Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition
Viaarxiv icon

Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement

Add code
Bookmark button
Alert button
Nov 08, 2022
Shucong Zhang, Malcolm Chadwick, Alberto Gil C. P. Ramos, Sourav Bhattacharya

Figure 1 for Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Figure 2 for Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Viaarxiv icon

Defensive Tensorization

Add code
Bookmark button
Alert button
Oct 26, 2021
Adrian Bulat, Jean Kossaifi, Sourav Bhattacharya, Yannis Panagakis, Timothy Hospedales, Georgios Tzimiropoulos, Nicholas D Lane, Maja Pantic

Figure 1 for Defensive Tensorization
Figure 2 for Defensive Tensorization
Figure 3 for Defensive Tensorization
Figure 4 for Defensive Tensorization
Viaarxiv icon

Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems

Add code
Bookmark button
Alert button
Aug 11, 2020
Ravichander Vipperla, Sangjun Park, Kihyun Choo, Samin Ishtiaq, Kyoungbo Min, Sourav Bhattacharya, Abhinav Mehrotra, Alberto Gil C. P. Ramos, Nicholas D. Lane

Figure 1 for Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Figure 2 for Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Figure 3 for Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Figure 4 for Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Viaarxiv icon

Iterative Compression of End-to-End ASR Model using AutoML

Add code
Bookmark button
Alert button
Aug 06, 2020
Abhinav Mehrotra, Łukasz Dudziak, Jinsu Yeo, Young-yoon Lee, Ravichander Vipperla, Mohamed S. Abdelfattah, Sourav Bhattacharya, Samin Ishtiaq, Alberto Gil C. P. Ramos, SangJeong Lee, Daehyun Kim, Nicholas D. Lane

Figure 1 for Iterative Compression of End-to-End ASR Model using AutoML
Figure 2 for Iterative Compression of End-to-End ASR Model using AutoML
Figure 3 for Iterative Compression of End-to-End ASR Model using AutoML
Figure 4 for Iterative Compression of End-to-End ASR Model using AutoML
Viaarxiv icon

MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors

Add code
Bookmark button
Alert button
Aug 21, 2019
Royson Lee, Stylianos I. Venieris, Łukasz Dudziak, Sourav Bhattacharya, Nicholas D. Lane

Figure 1 for MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Figure 2 for MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Figure 3 for MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Figure 4 for MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Viaarxiv icon

Cross-modal Recurrent Models for Weight Objective Prediction from Multimodal Time-series Data

Add code
Bookmark button
Alert button
Nov 29, 2017
Petar Veličković, Laurynas Karazija, Nicholas D. Lane, Sourav Bhattacharya, Edgar Liberis, Pietro Liò, Angela Chieh, Otmane Bellahsen, Matthieu Vegreville

Figure 1 for Cross-modal Recurrent Models for Weight Objective Prediction from Multimodal Time-series Data
Figure 2 for Cross-modal Recurrent Models for Weight Objective Prediction from Multimodal Time-series Data
Figure 3 for Cross-modal Recurrent Models for Weight Objective Prediction from Multimodal Time-series Data
Figure 4 for Cross-modal Recurrent Models for Weight Objective Prediction from Multimodal Time-series Data
Viaarxiv icon

Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition

Add code
Bookmark button
Alert button
Jul 23, 2014
Sourav Bhattacharya, Petteri Nurmi, Nils Hammerla, Thomas Plötz

Figure 1 for Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition
Figure 2 for Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition
Figure 3 for Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition
Figure 4 for Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition
Viaarxiv icon