Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents

Feb 14, 2021
Tsu-Jui Fu, William Yang Wang, Daniel McDuff, Yale Song


  Access Paper or Ask Questions

Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning

Jan 26, 2021
Sangho Lee, Jiwan Chung, Youngjae Yu, Gunhee Kim, Thomas Breuel, Gal Chechik, Yale Song


  Access Paper or Ask Questions

Learning to Transfer Visual Effects from Videos to Images

Dec 17, 2020
Christopher Thomas, Yale Song, Adriana Kovashka


  Access Paper or Ask Questions

Parameter Efficient Multimodal Transformers for Video Representation Learning

Dec 08, 2020
Sangho Lee, Youngjae Yu, Gunhee Kim, Thomas Breuel, Jan Kautz, Yale Song


  Access Paper or Ask Questions

Learning Audio-Visual Representations with Active Contrastive Coding

Aug 31, 2020
Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song


  Access Paper or Ask Questions

Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency

Oct 25, 2019
Matt Whitehill, Shuang Ma, Daniel McDuff, Yale Song


  Access Paper or Ask Questions

Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck

Aug 19, 2019
Shuang Ma, Daniel McDuff, Yale Song

* ICCV 2019 

  Access Paper or Ask Questions

Image to Video Domain Adaptation Using Web Supervision

Aug 05, 2019
Andrew Kae, Yale Song


  Access Paper or Ask Questions

Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval

Jul 17, 2019
Yale Song, Mohammad Soleymani

* CVPR 2019. Includes supplementary material. Have updated results on TGIF and MRW 

  Access Paper or Ask Questions

M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention

Jul 09, 2019
Shuang Ma, Daniel McDuff, Yale Song


  Access Paper or Ask Questions

Video Prediction with Appearance and Motion Conditions

Jul 07, 2018
Yunseok Jang, Gunhee Kim, Yale Song

* Accepted paper at ICML 2018. Project page: http://vision.snu.ac.kr/projects/amc-gan 

  Access Paper or Ask Questions

Cross-Modal Retrieval with Implicit Concept Association

Apr 25, 2018
Yale Song, Mohammad Soleymani


  Access Paper or Ask Questions

Image2GIF: Generating Cinemagraphs using Recurrent Deep Q-Networks

Jan 27, 2018
Yipin Zhou, Yale Song, Tamara L. Berg

* WACV2018 

  Access Paper or Ask Questions

ElasticPlay: Interactive Video Summarization with Dynamic Time Budgets

Aug 23, 2017
Haojian Jin, Yale Song, Koji Yatani

* ACM Multimedia 2017 preprint 

  Access Paper or Ask Questions

Improving Pairwise Ranking for Multi-label Image Classification

Jun 01, 2017
Yuncheng Li, Yale Song, Jiebo Luo

* cvpr 2017 

  Access Paper or Ask Questions

Learning from Noisy Labels with Distillation

Apr 07, 2017
Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, Li-Jia Li


  Access Paper or Ask Questions

Real-Time Video Highlights for Yahoo Esports

Nov 27, 2016
Yale Song


  Access Paper or Ask Questions

Video2GIF: Automatic Generation of Animated GIFs from Video

May 16, 2016
Michael Gygli, Yale Song, Liangliang Cao

* Accepted to CVPR 2016 

  Access Paper or Ask Questions

Balancing Appearance and Context in Sketch Interpretation

Apr 25, 2016
Yale Song, Randall Davis, Kaichen Ma, Dana L. Penny


  Access Paper or Ask Questions

TGIF: A New Dataset and Benchmark on Animated GIF Description

Apr 12, 2016
Yuncheng Li, Yale Song, Liangliang Cao, Joel Tetreault, Larry Goldberg, Alejandro Jaimes, Jiebo Luo

* CVPR 2016 Camera Ready 

  Access Paper or Ask Questions