Picture for Yongqiang Wang

Yongqiang Wang

SLM: Bridge the thin gap between speech and text foundation models

Add code
Sep 30, 2023
Figure 1 for SLM: Bridge the thin gap between speech and text foundation models
Figure 2 for SLM: Bridge the thin gap between speech and text foundation models
Figure 3 for SLM: Bridge the thin gap between speech and text foundation models
Figure 4 for SLM: Bridge the thin gap between speech and text foundation models
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Viaarxiv icon

Undergraduate Research of Decentralized Localization of Roombas Through Usage of Wall-Finding Software

Add code
Sep 11, 2023
Viaarxiv icon

Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation

Add code
Sep 05, 2023
Figure 1 for Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation
Figure 2 for Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation
Figure 3 for Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation
Figure 4 for Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation
Viaarxiv icon

Enhanced Residual SwinV2 Transformer for Learned Image Compression

Add code
Aug 23, 2023
Figure 1 for Enhanced Residual SwinV2 Transformer for Learned Image Compression
Figure 2 for Enhanced Residual SwinV2 Transformer for Learned Image Compression
Figure 3 for Enhanced Residual SwinV2 Transformer for Learned Image Compression
Figure 4 for Enhanced Residual SwinV2 Transformer for Learned Image Compression
Viaarxiv icon

Microvasculature Segmentation in Human BioMolecular Atlas Program (HuBMAP)

Add code
Aug 06, 2023
Viaarxiv icon

MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment

Add code
Jul 24, 2023
Figure 1 for MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment
Figure 2 for MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment
Figure 3 for MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment
Figure 4 for MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment
Viaarxiv icon

Locally Differentially Private Distributed Online Learning with Guaranteed Optimality

Add code
Jun 25, 2023
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon