Picture for He Huang

He Huang

A Trustworthy AIoT-enabled Localization System via Federated Learning and Blockchain

Add code
Jul 08, 2024
Viaarxiv icon

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

Add code
Jun 28, 2024
Viaarxiv icon

BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5

Add code
Jun 28, 2024
Viaarxiv icon

DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

Add code
Jun 27, 2024
Viaarxiv icon

Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation

Add code
Oct 18, 2023
Figure 1 for Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation
Figure 2 for Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation
Figure 3 for Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation
Figure 4 for Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation
Viaarxiv icon

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

Add code
Oct 18, 2023
Viaarxiv icon

SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation

Add code
Oct 13, 2023
Figure 1 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Figure 2 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Figure 3 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Figure 4 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Viaarxiv icon

AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi

Add code
Sep 29, 2023
Viaarxiv icon

Practical Parallel Algorithms for Non-Monotone Submodular Maximization

Add code
Aug 21, 2023
Figure 1 for Practical Parallel Algorithms for Non-Monotone Submodular Maximization
Figure 2 for Practical Parallel Algorithms for Non-Monotone Submodular Maximization
Figure 3 for Practical Parallel Algorithms for Non-Monotone Submodular Maximization
Figure 4 for Practical Parallel Algorithms for Non-Monotone Submodular Maximization
Viaarxiv icon

Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling

Add code
Jul 13, 2023
Figure 1 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Figure 2 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Figure 3 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Figure 4 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Viaarxiv icon