Alert button
Picture for Vladimir Iashin

Vladimir Iashin

Alert button

Synchformer: Efficient Synchronization from Sparse Cues

Jan 29, 2024
Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman

Viaarxiv icon

Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors

Oct 13, 2022
Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman

Figure 1 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Figure 2 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Figure 3 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Figure 4 for Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Viaarxiv icon

Taming Visually Guided Sound Generation

Oct 17, 2021
Vladimir Iashin, Esa Rahtu

Figure 1 for Taming Visually Guided Sound Generation
Figure 2 for Taming Visually Guided Sound Generation
Figure 3 for Taming Visually Guided Sound Generation
Figure 4 for Taming Visually Guided Sound Generation
Viaarxiv icon

Multi-modal estimation of the properties of containers and their content: survey and evaluation

Jul 27, 2021
Alessio Xompero, Santiago Donaher, Vladimir Iashin, Francesca Palermo, Gökhan Solak, Claudio Coppola, Reina Ishikawa, Yuichi Nagao, Ryo Hachiuma, Qi Liu, Fan Feng, Chuanlin Lan, Rosa H. M. Chan, Guilherme Christmann, Jyun-Ting Song, Gonuguntla Neeharika, Chinnakotla Krishna Teja Reddy, Dinesh Jain, Bakhtawar Ur Rehman, Andrea Cavallaro

Figure 1 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Figure 2 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Figure 3 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Figure 4 for Multi-modal estimation of the properties of containers and their content: survey and evaluation
Viaarxiv icon

Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers

Dec 02, 2020
Vladimir Iashin, Francesca Palermo, Gökhan Solak, Claudio Coppola

Figure 1 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Figure 2 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Figure 3 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Figure 4 for Top-1 CORSMAL Challenge 2020 Submission: Filling Mass Estimation Using Multi-modal Observations of Human-robot Handovers
Viaarxiv icon

A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer

May 17, 2020
Vladimir Iashin, Esa Rahtu

Figure 1 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Figure 2 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Figure 3 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Figure 4 for A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Viaarxiv icon

Multi-modal Dense Video Captioning

Mar 17, 2020
Vladimir Iashin, Esa Rahtu

Figure 1 for Multi-modal Dense Video Captioning
Figure 2 for Multi-modal Dense Video Captioning
Figure 3 for Multi-modal Dense Video Captioning
Figure 4 for Multi-modal Dense Video Captioning
Viaarxiv icon