Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics


May 12, 2022
David M. Chan , Austin Myers , Sudheendra Vijayanarasimhan , David A. Ross , Bryan Seybold , John F. Canny

* The 1st Workshop on Vision Datasets Understanding, IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learn to Dance with AIST++: Music Conditioned 3D Dance Generation


Feb 02, 2021
Ruilong Li , Shan Yang , David A. Ross , Angjoo Kanazawa

* Project page: https://google.github.io/aichoreographer/; Dataset page: https://google.github.io/aistplusplus_dataset/ 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Active Learning for Video Description With Cluster-Regularized Ensemble Ranking


Jul 29, 2020
David M. Chan , Sudheendra Vijayanarasimhan , David A. Ross , John Canny


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning Video Representations from Textual Web Supervision


Jul 29, 2020
Jonathan C. Stroud , David A. Ross , Chen Sun , Jia Deng , Rahul Sukthankar , Cordelia Schmid


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The AVA-Kinetics Localized Human Actions Video Dataset


May 20, 2020
Ang Li , Meghana Thotakuri , David A. Ross , João Carreira , Alexander Vostrikov , Andrew Zisserman

* 8 pages, 8 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

D3D: Distilled 3D Networks for Video Action Recognition


Dec 19, 2018
Jonathan C. Stroud , David A. Ross , Chen Sun , Jia Deng , Rahul Sukthankar


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions


Apr 30, 2018
Chunhui Gu , Chen Sun , David A. Ross , Carl Vondrick , Caroline Pantofaru , Yeqing Li , Sudheendra Vijayanarasimhan , George Toderici , Susanna Ricco , Rahul Sukthankar , Cordelia Schmid , Jitendra Malik

* To appear in CVPR 2018. Check dataset page https://research.google.com/ava/ for details 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>