Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Vladimir Iashin

Taming Visually Guided Sound Generation


Oct 17, 2021
Vladimir Iashin, Esa Rahtu

* Accepted as an oral presentation for the BMVC 2021. Code: https://github.com/v-iashin/SpecVQGAN Project page: https://v-iashin.github.io/SpecVQGAN 

  Access Paper or Ask Questions

Multi-modal estimation of the properties of containers and their content: survey and evaluation


Jul 27, 2021
Alessio Xompero, Santiago Donaher, Vladimir Iashin, Francesca Palermo, Gökhan Solak, Claudio Coppola, Reina Ishikawa, Yuichi Nagao, Ryo Hachiuma, Qi Liu, Fan Feng, Chuanlin Lan, Rosa H. M. Chan, Guilherme Christmann, Jyun-Ting Song, Gonuguntla Neeharika, Chinnakotla Krishna Teja Reddy, Dinesh Jain, Bakhtawar Ur Rehman, Andrea Cavallaro

* 13 pages, 9 tables, 5 figures, submitted to IEEE Transactions on Multimedia 

  Access Paper or Ask Questions

Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers


Dec 02, 2020
Vladimir Iashin, Francesca Palermo, Gökhan Solak, Claudio Coppola

* Code: https://github.com/v-iashin/CORSMAL Docker: https://hub.docker.com/r/iashin/corsmal 

  Access Paper or Ask Questions

A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer


May 17, 2020
Vladimir Iashin, Esa Rahtu

* Project page is available on https://v-iashin.github.io/bmt 

  Access Paper or Ask Questions

Multi-modal Dense Video Captioning


Mar 17, 2020
Vladimir Iashin, Esa Rahtu

* 13 pages, 6 figures 

  Access Paper or Ask Questions