Picture for Ho-Lam Chung

Ho-Lam Chung

Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

Add code
Feb 27, 2024
Viaarxiv icon

Towards audio language modeling -- an overview

Add code
Feb 20, 2024
Figure 1 for Towards audio language modeling -- an overview
Figure 2 for Towards audio language modeling -- an overview
Figure 3 for Towards audio language modeling -- an overview
Figure 4 for Towards audio language modeling -- an overview
Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Feb 20, 2024
Figure 1 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 2 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 3 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 4 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Viaarxiv icon

GSQA: An End-to-End Model for Generative Spoken Question Answering

Add code
Dec 25, 2023
Viaarxiv icon

Towards General-Purpose Text-Instruction-Guided Voice Conversion

Add code
Sep 25, 2023
Figure 1 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 2 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 3 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 4 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Viaarxiv icon

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Add code
May 18, 2023
Figure 1 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 2 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 3 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Viaarxiv icon

T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5

Add code
Nov 01, 2022
Figure 1 for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Figure 2 for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Figure 3 for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Figure 4 for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Viaarxiv icon

DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering

Add code
Mar 26, 2022
Figure 1 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 2 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 3 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 4 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Viaarxiv icon

Improving Controllability of Educational Question Generation by Keyword Provision

Add code
Dec 02, 2021
Figure 1 for Improving Controllability of Educational Question Generation by Keyword Provision
Figure 2 for Improving Controllability of Educational Question Generation by Keyword Provision
Figure 3 for Improving Controllability of Educational Question Generation by Keyword Provision
Figure 4 for Improving Controllability of Educational Question Generation by Keyword Provision
Viaarxiv icon

A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies

Add code
Oct 12, 2020
Figure 1 for A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies
Figure 2 for A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies
Figure 3 for A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies
Figure 4 for A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies
Viaarxiv icon