Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer


Jul 14, 2022
Wei-Ning Hsu , Bowen Shi


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Open-Domain Sign Language Translation Learned from Online Video


May 25, 2022
Bowen Shi , Diane Brentari , Greg Shakhnarovich , Karen Livescu

* 17 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT


May 15, 2022
Bowen Shi , Abdelrahman Mohamed , Wei-Ning Hsu

* Submitted to Interspeech 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Searching for fingerspelled content in American Sign Language


Mar 24, 2022
Bowen Shi , Diane Brentari , Greg Shakhnarovich , Karen Livescu

* ACL 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Robust Self-Supervised Audio-Visual Speech Recognition


Jan 05, 2022
Bowen Shi , Wei-Ning Hsu , Abdelrahman Mohamed


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction


Jan 05, 2022
Bowen Shi , Wei-Ning Hsu , Kushal Lakhotia , Abdelrahman Mohamed


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Hierarchical Graph Networks for 3D Human Pose Estimation


Nov 23, 2021
Han Li , Bowen Shi , Wenrui Dai , Yabo Chen , Botao Wang , Yu Sun , Min Guo , Chenlin Li , Junni Zou , Hongkai Xiong

* accepted by BMVC 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Multi-dataset Pretraining: A Unified Model for Semantic Segmentation


Jun 08, 2021
Bowen Shi , Xiaopeng Zhang , Haohang Xu , Wenrui Dai , Junni Zou , Hongkai Xiong , Qi Tian


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Fingerspelling Detection in American Sign Language


Apr 03, 2021
Bowen Shi , Diane Brentari , Greg Shakhnarovich , Karen Livescu

* CVPR 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings


Jul 01, 2020
Bowen Shi , Shane Settle , Karen Livescu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>