Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Tiny Transfer Learning: Towards Memory-Efficient On-Device Learning

Jul 28, 2020
Han Cai, Chuang Gan, Ligeng Zhu, Song Han


  Access Model/Code and Paper
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events

Jul 27, 2020
Chuang Gan, Xiaoyu Chen, Phillip Isola, Antonio Torralba, Joshua B. Tenenbaum

* Project page: http://noisy-agent.csail.mit.edu 

  Access Model/Code and Paper
Foley Music: Learning to Generate Music from Videos

Jul 21, 2020
Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba

* ECCV 2020. Project page: http://foley-music.csail.mit.edu 

  Access Model/Code and Paper
MCUNet: Tiny Deep Learning on IoT Devices

Jul 20, 2020
Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han

* Demo video available here: https://youtu.be/YvioBgtec4U 

  Access Model/Code and Paper
Generating Visually Aligned Sound from Videos

Jul 14, 2020
Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, Chuang Gan

* Published in IEEE Transactions on Image Processing, 2020. Code, pre-trained models and demo video: https://github.com/PeihaoChen/regnet 

  Access Model/Code and Paper
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

Jul 09, 2020
Chuang Gan, Jeremy Schwartz, Seth Alter, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Damian Mrowca, Michael Lingelbach, Aidan Curtis, Kevin Feigelis, Daniel M. Bear, Dan Gutfreund, David Cox, James J. DiCarlo, Josh McDermott, Joshua B. Tenenbaum, Daniel L. K. Yamins

* Project page: http://www.threedworld.org 

  Access Model/Code and Paper
Language Guided Networks for Cross-modal Moment Retrieval

Jun 18, 2020
Kun Liu, Xun Yang, Tat-seng Chua, Huadong Ma, Chuang Gan


  Access Model/Code and Paper
A Real-time Action Representation with Temporal Encoding and Deep Compression

Jun 17, 2020
Kun Liu, Wu Liu, Huadong Ma, Mingkui Tan, Chuang Gan


  Access Model/Code and Paper
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

May 28, 2020
Hanrui Wang, Zhanghao Wu, Zhijian Liu, Han Cai, Ligeng Zhu, Chuang Gan, Song Han

* Accepted to ACL 2020. 14 pages, 12 figures. Code available at http://github.com/mit-han-lab/hardware-aware-transformers.git 

  Access Model/Code and Paper
Music Gesture for Visual Sound Separation

Apr 20, 2020
Chuang Gan, Deng Huang, Hang Zhao, Joshua B. Tenenbaum, Antonio Torralba

* CVPR 2020. Project page: http://music-gesture.csail.mit.edu 

  Access Model/Code and Paper
Dense Regression Network for Video Grounding

Apr 07, 2020
Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan

* CVPR 2020 

  Access Model/Code and Paper
Visual Concept-Metaconcept Learning

Feb 04, 2020
Chi Han, Jiayuan Mao, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu

* NeurIPS 2019. First two authors contributed equally. Project page: http://vcml.csail.mit.edu/ 

  Access Model/Code and Paper
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation

Dec 25, 2019
Chuang Gan, Yiwei Zhang, Jiajun Wu, Boqing Gong, Joshua B. Tenenbaum


  Access Model/Code and Paper
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement

Nov 18, 2019
Chao Yang, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu, Junzhou Huang, Chuang Gan

* Accepted to NeurIPS 2019 as a spotlight. Chao Yang and Xiaojian Ma contributed equally to this work 

  Access Model/Code and Paper
Self-supervised Moving Vehicle Tracking with Stereo Sound

Oct 25, 2019
Chuang Gan, Hang Zhao, Peihao Chen, David Cox, Antonio Torralba

* To appear at ICCV 2019. Project page: http://sound-track.csail.mit.edu 

  Access Model/Code and Paper
TruNet: Short Videos Generation from Long Videos via Story-Preserving Truncation

Oct 14, 2019
Fan Yang, Xiao Liu, Dongliang He, Chuang Gan, Jian Wang, Chao Li, Fu Li, Shilei Wen

* ICCV intelligent short video workshop 

  Access Model/Code and Paper
CLEVRER: CoLlision Events for Video REpresentation and Reasoning

Oct 03, 2019
Kexin Yi, Chuang Gan, Yunzhu Li, Pushmeet Kohli, Jiajun Wu, Antonio Torralba, Joshua B. Tenenbaum

* The first two authors contributed equally to this work. Project page: http://clevrer.csail.mit.edu/ 

  Access Model/Code and Paper
Training Kinetics in 15 Minutes: Large-scale Distributed Training on Videos

Oct 01, 2019
Ji Lin, Chuang Gan, Song Han


  Access Model/Code and Paper
Graph Convolutional Networks for Temporal Action Localization

Sep 07, 2019
Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan

* ICCV 2019 

  Access Model/Code and Paper
Once for All: Train One Network and Specialize it for Efficient Deployment

Aug 26, 2019
Han Cai, Chuang Gan, Song Han


  Access Model/Code and Paper
Deep Concept-wise Temporal Convolutional Networks for Action Localization

Aug 26, 2019
Xin Li, Tianwei Lin, Xiao Liu, Chuang Gan, Wangmeng Zuo, Chao Li, Xiang Long, Dongliang He, Fu Li, Shilei Wen


  Access Model/Code and Paper
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision

Apr 26, 2019
Jiayuan Mao, Chuang Gan, Pushmeet Kohli, Joshua B. Tenenbaum, Jiajun Wu

* ICLR 2019 (Oral). Project page: http://nscl.csail.mit.edu/ 

  Access Model/Code and Paper
Self-Supervised Audio-Visual Co-Segmentation

Apr 18, 2019
Andrew Rouditchenko, Hang Zhao, Chuang Gan, Josh McDermott, Antonio Torralba

* Accepted to ICASSP 2019 

  Access Model/Code and Paper
Defensive Quantization: When Efficiency Meets Robustness

Apr 17, 2019
Ji Lin, Chuang Gan, Song Han


  Access Model/Code and Paper
The Sound of Motions

Apr 11, 2019
Hang Zhao, Chuang Gan, Wei-Chiu Ma, Antonio Torralba


  Access Model/Code and Paper
Interpreting Adversarial Examples by Activation Promotion and Suppression

Apr 03, 2019
Kaidi Xu, Sijia Liu, Gaoyuan Zhang, Mengshu Sun, Pu Zhao, Quanfu Fan, Chuang Gan, Xue Lin


  Access Model/Code and Paper
Weakly Supervised Dense Event Captioning in Videos

Dec 10, 2018
Xuguang Duan, Wenbing Huang, Chuang Gan, Jingdong Wang, Wenwu Zhu, Junzhou Huang

* NeurIPS 2018 

  Access Model/Code and Paper