Alert button
Picture for David M. Chan

David M. Chan

Alert button

ALOHa: A New Measure for Hallucination in Captioning Models

Add code
Bookmark button
Alert button
Apr 03, 2024
Suzanne Petryk, David M. Chan, Anish Kachinthaya, Haodi Zou, John Canny, Joseph E. Gonzalez, Trevor Darrell

Viaarxiv icon

ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video

Add code
Bookmark button
Alert button
Jan 10, 2024
Kevin Cai, Chonghua Liu, David M. Chan

Viaarxiv icon

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jan 04, 2024
David M. Chan, Shalini Ghosh, Hitesh Tulsiani, Ariya Rastrow, Björn Hoffmeister

Viaarxiv icon

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Add code
Bookmark button
Alert button
Dec 22, 2023
Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu

Figure 1 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 2 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 3 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 4 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Viaarxiv icon

$IC^3$: Image Captioning by Committee Consensus

Add code
Bookmark button
Alert button
Feb 16, 2023
David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, John Canny

Figure 1 for $IC^3$: Image Captioning by Committee Consensus
Figure 2 for $IC^3$: Image Captioning by Committee Consensus
Figure 3 for $IC^3$: Image Captioning by Committee Consensus
Figure 4 for $IC^3$: Image Captioning by Committee Consensus
Viaarxiv icon

Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition

Add code
Bookmark button
Alert button
Jan 06, 2023
David M. Chan, Shalini Ghosh, Ariya Rastrow, Björn Hoffmeister

Figure 1 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Figure 2 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Figure 3 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Figure 4 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Viaarxiv icon

Towards Understanding How Machines Can Learn Causal Overhypotheses

Add code
Bookmark button
Alert button
Jun 16, 2022
Eliza Kosoy, David M. Chan, Adrian Liu, Jasmine Collins, Bryanna Kaufmann, Sandy Han Huang, Jessica B. Hamrick, John Canny, Nan Rosemary Ke, Alison Gopnik

Figure 1 for Towards Understanding How Machines Can Learn Causal Overhypotheses
Figure 2 for Towards Understanding How Machines Can Learn Causal Overhypotheses
Figure 3 for Towards Understanding How Machines Can Learn Causal Overhypotheses
Figure 4 for Towards Understanding How Machines Can Learn Causal Overhypotheses
Viaarxiv icon

Content-Context Factorized Representations for Automated Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2022
David M. Chan, Shalini Ghosh

Figure 1 for Content-Context Factorized Representations for Automated Speech Recognition
Figure 2 for Content-Context Factorized Representations for Automated Speech Recognition
Figure 3 for Content-Context Factorized Representations for Automated Speech Recognition
Figure 4 for Content-Context Factorized Representations for Automated Speech Recognition
Viaarxiv icon

What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics

Add code
Bookmark button
Alert button
May 12, 2022
David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, Bryan Seybold, John F. Canny

Figure 1 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 2 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 3 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 4 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Viaarxiv icon