Picture for David Chiang

David Chiang

Probability Distributions Computed by Hard-Attention Transformers

Add code
Oct 31, 2025
Viaarxiv icon

Frustratingly Easy Data Augmentation for Low-Resource ASR

Add code
Sep 18, 2025
Viaarxiv icon

Using Source-Side Confidence Estimation for Reliable Translation into Unfamiliar Languages

Add code
Mar 30, 2025
Figure 1 for Using Source-Side Confidence Estimation for Reliable Translation into Unfamiliar Languages
Figure 2 for Using Source-Side Confidence Estimation for Reliable Translation into Unfamiliar Languages
Figure 3 for Using Source-Side Confidence Estimation for Reliable Translation into Unfamiliar Languages
Figure 4 for Using Source-Side Confidence Estimation for Reliable Translation into Unfamiliar Languages
Viaarxiv icon

Simulating Hard Attention Using Soft Attention

Add code
Dec 13, 2024
Figure 1 for Simulating Hard Attention Using Soft Attention
Figure 2 for Simulating Hard Attention Using Soft Attention
Viaarxiv icon

Improving Rare Word Translation With Dictionaries and Attention Masking

Add code
Aug 17, 2024
Figure 1 for Improving Rare Word Translation With Dictionaries and Attention Masking
Figure 2 for Improving Rare Word Translation With Dictionaries and Attention Masking
Figure 3 for Improving Rare Word Translation With Dictionaries and Attention Masking
Figure 4 for Improving Rare Word Translation With Dictionaries and Attention Masking
Viaarxiv icon

Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't

Add code
Jun 13, 2024
Figure 1 for Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
Figure 2 for Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
Figure 3 for Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
Figure 4 for Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
Viaarxiv icon

PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin

Add code
Apr 25, 2024
Viaarxiv icon

Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information

Add code
Apr 23, 2024
Figure 1 for Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
Figure 2 for Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
Figure 3 for Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
Figure 4 for Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
Viaarxiv icon

Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation

Add code
Apr 11, 2024
Figure 1 for Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation
Figure 2 for Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation
Figure 3 for Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation
Figure 4 for Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation
Viaarxiv icon

We're Calling an Intervention: Taking a Closer Look at Language Model Adaptation to Different Types of Linguistic Variation

Add code
Apr 10, 2024
Figure 1 for We're Calling an Intervention: Taking a Closer Look at Language Model Adaptation to Different Types of Linguistic Variation
Figure 2 for We're Calling an Intervention: Taking a Closer Look at Language Model Adaptation to Different Types of Linguistic Variation
Figure 3 for We're Calling an Intervention: Taking a Closer Look at Language Model Adaptation to Different Types of Linguistic Variation
Figure 4 for We're Calling an Intervention: Taking a Closer Look at Language Model Adaptation to Different Types of Linguistic Variation
Viaarxiv icon