Alert button
Picture for Alexander Kunitsyn

Alexander Kunitsyn

Alert button

VLRM: Vision-Language Models act as Reward Models for Image Captioning

Add code
Bookmark button
Alert button
Apr 02, 2024
Maksim Dzabraev, Alexander Kunitsyn, Andrei Ivaniuta

Viaarxiv icon

MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization

Add code
Bookmark button
Alert button
Mar 14, 2022
Alexander Kunitsyn, Maksim Kalashnikov, Maksim Dzabraev, Andrei Ivaniuta

Figure 1 for MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization
Figure 2 for MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization
Figure 3 for MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization
Figure 4 for MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization
Viaarxiv icon