Picture for Vicky Zayats

Vicky Zayats

Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

Add code
May 29, 2024
Viaarxiv icon

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup

Add code
May 19, 2023
Figure 1 for MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup
Figure 2 for MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup
Figure 3 for MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup
Figure 4 for MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup
Viaarxiv icon

Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection

Add code
May 02, 2022
Figure 1 for Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection
Figure 2 for Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection
Figure 3 for Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection
Figure 4 for Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection
Viaarxiv icon

Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

Add code
Sep 14, 2021
Figure 1 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 2 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 3 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 4 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Viaarxiv icon

Disfluency Detection with Unlabeled Data and Small BERT Models

Add code
Apr 21, 2021
Figure 1 for Disfluency Detection with Unlabeled Data and Small BERT Models
Figure 2 for Disfluency Detection with Unlabeled Data and Small BERT Models
Figure 3 for Disfluency Detection with Unlabeled Data and Small BERT Models
Figure 4 for Disfluency Detection with Unlabeled Data and Small BERT Models
Viaarxiv icon

Representations for Question Answering from Documents with Tables and Text

Add code
Jan 26, 2021
Figure 1 for Representations for Question Answering from Documents with Tables and Text
Figure 2 for Representations for Question Answering from Documents with Tables and Text
Figure 3 for Representations for Question Answering from Documents with Tables and Text
Figure 4 for Representations for Question Answering from Documents with Tables and Text
Viaarxiv icon

Disfluencies and Human Speech Transcription Errors

Add code
Apr 08, 2019
Figure 1 for Disfluencies and Human Speech Transcription Errors
Figure 2 for Disfluencies and Human Speech Transcription Errors
Figure 3 for Disfluencies and Human Speech Transcription Errors
Figure 4 for Disfluencies and Human Speech Transcription Errors
Viaarxiv icon

Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection

Add code
Apr 08, 2019
Figure 1 for Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection
Figure 2 for Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection
Figure 3 for Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection
Figure 4 for Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection
Viaarxiv icon