Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition



Yuan Gong , Jin Yu , James Glass

* Accepted at ICASSP 2022. Dataset and code at https://github.com/YuanGongND/vocalsound Interactive Colab demo at https://colab.research.google.com/github/YuanGongND/vocalsound/blob/main/colab/VocalSound.ipynb 

   Access Paper or Ask Questions

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment



Yuan Gong , Ziyi Chen , Iek-Heng Chu , Peng Chang , James Glass

* Accepted at ICASSP 2022. Code at https://github.com/YuanGongND/gopt Interactive Colab demo at https://colab.research.google.com/github/YuanGongND/gopt/blob/master/colab/GOPT_GPU.ipynb . ICASSP 2022 

   Access Paper or Ask Questions

Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network



Shanshan Lao , Yuan Gong , Shuwei Shi , Sidi Yang , Tianhe Wu , Jiahao Wang , Weihao Xia , Yujiu Yang


   Access Paper or Ask Questions

MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment



Sidi Yang , Tianhe Wu , Shuwei Shi , Shanshan Lao , Yuan Gong , Mingdeng Cao , Jiahao Wang , Yujiu Yang


   Access Paper or Ask Questions

CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification



Yuan Gong , Sameer Khurana , Andrew Rouditchenko , James Glass


   Access Paper or Ask Questions

Focal and Global Knowledge Distillation for Detectors



Zhendong Yang , Zhe Li , Xiaohu Jiang , Yuan Gong , Zehuan Yuan , Danpei Zhao , Chun Yuan

* 10 pages, 6 figures, 7 tables 

   Access Paper or Ask Questions

SSAST: Self-Supervised Audio Spectrogram Transformer



Yuan Gong , Cheng-I Jeff Lai , Yu-An Chung , James Glass


   Access Paper or Ask Questions

AST: Audio Spectrogram Transformer



Yuan Gong , Yu-An Chung , James Glass


   Access Paper or Ask Questions

PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation



Yuan Gong , Yu-An Chung , James Glass


   Access Paper or Ask Questions

1
2
>>