Alert button
Picture for Vladimir Iashin

Vladimir Iashin

Alert button

Synchformer: Efficient Synchronization from Sparse Cues

Add code
Bookmark button
Alert button
Jan 29, 2024
Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman

Viaarxiv icon

Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors

Add code
Bookmark button
Alert button
Oct 13, 2022
Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman

Figure 1 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Figure 2 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Figure 3 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Figure 4 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Viaarxiv icon

Taming Visually Guided Sound Generation

Add code
Bookmark button
Alert button
Oct 17, 2021
Vladimir Iashin, Esa Rahtu

Figure 1 for Taming Visually Guided Sound Generation
Figure 2 for Taming Visually Guided Sound Generation
Figure 3 for Taming Visually Guided Sound Generation
Figure 4 for Taming Visually Guided Sound Generation
Viaarxiv icon

Multi-modal estimation of the properties of containers and their content: survey and evaluation

Add code
Bookmark button
Alert button
Jul 27, 2021
Alessio Xompero, Santiago Donaher, Vladimir Iashin, Francesca Palermo, Gökhan Solak, Claudio Coppola, Reina Ishikawa, Yuichi Nagao, Ryo Hachiuma, Qi Liu, Fan Feng, Chuanlin Lan, Rosa H. M. Chan, Guilherme Christmann, Jyun-Ting Song, Gonuguntla Neeharika, Chinnakotla Krishna Teja Reddy, Dinesh Jain, Bakhtawar Ur Rehman, Andrea Cavallaro

Figure 1 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Figure 2 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Figure 3 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Figure 4 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Viaarxiv icon

Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers

Add code
Bookmark button
Alert button
Dec 02, 2020
Vladimir Iashin, Francesca Palermo, Gökhan Solak, Claudio Coppola

Figure 1 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Figure 2 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Figure 3 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Figure 4 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Viaarxiv icon

A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer

Add code
Bookmark button
Alert button
May 17, 2020
Vladimir Iashin, Esa Rahtu

Figure 1 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Figure 2 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Figure 3 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Figure 4 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Viaarxiv icon

Multi-modal Dense Video Captioning

Add code
Bookmark button
Alert button
Mar 17, 2020
Vladimir Iashin, Esa Rahtu

Figure 1 for Multi-modal Dense Video Captioning
Figure 2 for Multi-modal Dense Video Captioning
Figure 3 for Multi-modal Dense Video Captioning
Figure 4 for Multi-modal Dense Video Captioning
Viaarxiv icon