Alert button
Picture for Minesh Mathew

Minesh Mathew

Alert button

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering

Sep 11, 2023
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 2 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 3 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 4 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Jul 08, 2023
George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Reading Between the Lanes: Text VideoQA on the Road
Figure 2 for Reading Between the Lanes: Text VideoQA on the Road
Figure 3 for Reading Between the Lanes: Text VideoQA on the Road
Figure 4 for Reading Between the Lanes: Text VideoQA on the Road
Viaarxiv icon

Watching the News: Towards VideoQA Models that can Read

Nov 10, 2022
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Watching the News: Towards VideoQA Models that can Read
Figure 2 for Watching the News: Towards VideoQA Models that can Read
Figure 3 for Watching the News: Towards VideoQA Models that can Read
Figure 4 for Watching the News: Towards VideoQA Models that can Read
Viaarxiv icon

An empirical study of CTC based models for OCR of Indian languages

May 13, 2022
Minesh Mathew, CV Jawahar

Figure 1 for An empirical study of CTC based models for OCR of Indian languages
Figure 2 for An empirical study of CTC based models for OCR of Indian languages
Figure 3 for An empirical study of CTC based models for OCR of Indian languages
Figure 4 for An empirical study of CTC based models for OCR of Indian languages
Viaarxiv icon

ICDAR 2021 Competition on Document VisualQuestion Answering

Nov 10, 2021
Rubèn Tito, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas

Figure 1 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 2 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 3 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 4 for ICDAR 2021 Competition on Document VisualQuestion Answering
Viaarxiv icon

Asking questions on handwritten document collections

Oct 02, 2021
Minesh Mathew, Lluis Gomez, Dimosthenis Karatzas, CV Jawahar

Figure 1 for Asking questions on handwritten document collections
Figure 2 for Asking questions on handwritten document collections
Figure 3 for Asking questions on handwritten document collections
Figure 4 for Asking questions on handwritten document collections
Viaarxiv icon

InfographicVQA

Apr 26, 2021
Minesh Mathew, Viraj Bagal, Rubèn Pérez Tito, Dimosthenis Karatzas, Ernest Valveny, C. V Jawahar

Figure 1 for InfographicVQA
Figure 2 for InfographicVQA
Figure 3 for InfographicVQA
Figure 4 for InfographicVQA
Viaarxiv icon

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Apr 09, 2021
Minesh Mathew, Mohit Jain, CV Jawahar

Figure 1 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Figure 2 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Figure 3 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Figure 4 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Viaarxiv icon

MMBERT: Multimodal BERT Pretraining for Improved Medical VQA

Apr 03, 2021
Yash Khare, Viraj Bagal, Minesh Mathew, Adithi Devi, U Deva Priyakumar, CV Jawahar

Figure 1 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Figure 2 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Figure 3 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Figure 4 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Viaarxiv icon