Alert button
Picture for Yale Song

Yale Song

Alert button

Contrastive Learning of Global and Local Audio-Visual Representations

Add code
Bookmark button
Alert button
Apr 07, 2021
Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song

Figure 1 for Contrastive Learning of Global and Local Audio-Visual Representations
Figure 2 for Contrastive Learning of Global and Local Audio-Visual Representations
Figure 3 for Contrastive Learning of Global and Local Audio-Visual Representations
Figure 4 for Contrastive Learning of Global and Local Audio-Visual Representations
Viaarxiv icon

DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents

Add code
Bookmark button
Alert button
Feb 14, 2021
Tsu-Jui Fu, William Yang Wang, Daniel McDuff, Yale Song

Figure 1 for DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
Figure 2 for DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
Figure 3 for DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
Figure 4 for DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
Viaarxiv icon

Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning

Add code
Bookmark button
Alert button
Jan 26, 2021
Sangho Lee, Jiwan Chung, Youngjae Yu, Gunhee Kim, Thomas Breuel, Gal Chechik, Yale Song

Figure 1 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 2 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 3 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 4 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Viaarxiv icon

Learning to Transfer Visual Effects from Videos to Images

Add code
Bookmark button
Alert button
Dec 17, 2020
Christopher Thomas, Yale Song, Adriana Kovashka

Figure 1 for Learning to Transfer Visual Effects from Videos to Images
Figure 2 for Learning to Transfer Visual Effects from Videos to Images
Figure 3 for Learning to Transfer Visual Effects from Videos to Images
Figure 4 for Learning to Transfer Visual Effects from Videos to Images
Viaarxiv icon

Parameter Efficient Multimodal Transformers for Video Representation Learning

Add code
Bookmark button
Alert button
Dec 08, 2020
Sangho Lee, Youngjae Yu, Gunhee Kim, Thomas Breuel, Jan Kautz, Yale Song

Figure 1 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Figure 2 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Figure 3 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Figure 4 for Parameter Efficient Multimodal Transformers for Video Representation Learning
Viaarxiv icon

Learning Audio-Visual Representations with Active Contrastive Coding

Add code
Bookmark button
Alert button
Aug 31, 2020
Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song

Figure 1 for Learning Audio-Visual Representations with Active Contrastive Coding
Figure 2 for Learning Audio-Visual Representations with Active Contrastive Coding
Figure 3 for Learning Audio-Visual Representations with Active Contrastive Coding
Figure 4 for Learning Audio-Visual Representations with Active Contrastive Coding
Viaarxiv icon

Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency

Add code
Bookmark button
Alert button
Oct 25, 2019
Matt Whitehill, Shuang Ma, Daniel McDuff, Yale Song

Figure 1 for Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
Figure 2 for Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
Figure 3 for Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
Figure 4 for Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
Viaarxiv icon

Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck

Add code
Bookmark button
Alert button
Aug 19, 2019
Shuang Ma, Daniel McDuff, Yale Song

Figure 1 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Figure 2 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Figure 3 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Figure 4 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Viaarxiv icon

Image to Video Domain Adaptation Using Web Supervision

Add code
Bookmark button
Alert button
Aug 05, 2019
Andrew Kae, Yale Song

Figure 1 for Image to Video Domain Adaptation Using Web Supervision
Figure 2 for Image to Video Domain Adaptation Using Web Supervision
Figure 3 for Image to Video Domain Adaptation Using Web Supervision
Figure 4 for Image to Video Domain Adaptation Using Web Supervision
Viaarxiv icon

Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval

Add code
Bookmark button
Alert button
Jul 17, 2019
Yale Song, Mohammad Soleymani

Figure 1 for Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Figure 2 for Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Figure 3 for Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Figure 4 for Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Viaarxiv icon