Picture for Lucrezia Tosato

Lucrezia Tosato

Sentinel2Cap: A Human-Annotated Benchmark Dataset for Multimodal Remote Sensing Image Captioning

Add code
May 04, 2026
Viaarxiv icon

Visual Question Answering on Multiple Remote Sensing Image Modalities

Add code
May 21, 2025
Figure 1 for Visual Question Answering on Multiple Remote Sensing Image Modalities
Figure 2 for Visual Question Answering on Multiple Remote Sensing Image Modalities
Figure 3 for Visual Question Answering on Multiple Remote Sensing Image Modalities
Figure 4 for Visual Question Answering on Multiple Remote Sensing Image Modalities
Viaarxiv icon

SAR Strikes Back: A New Hope for RSVQA

Add code
Jan 14, 2025
Figure 1 for SAR Strikes Back: A New Hope for RSVQA
Figure 2 for SAR Strikes Back: A New Hope for RSVQA
Figure 3 for SAR Strikes Back: A New Hope for RSVQA
Figure 4 for SAR Strikes Back: A New Hope for RSVQA
Viaarxiv icon

Exploiting temporal information to detect conversational groups in videos and predict the next speaker

Add code
Aug 29, 2024
Figure 1 for Exploiting temporal information to detect conversational groups in videos and predict the next speaker
Figure 2 for Exploiting temporal information to detect conversational groups in videos and predict the next speaker
Figure 3 for Exploiting temporal information to detect conversational groups in videos and predict the next speaker
Figure 4 for Exploiting temporal information to detect conversational groups in videos and predict the next speaker
Viaarxiv icon

Can SAR improve RSVQA performance?

Add code
Aug 28, 2024
Figure 1 for Can SAR improve RSVQA performance?
Figure 2 for Can SAR improve RSVQA performance?
Figure 3 for Can SAR improve RSVQA performance?
Figure 4 for Can SAR improve RSVQA performance?
Viaarxiv icon

Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images

Add code
Jul 11, 2024
Figure 1 for Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images
Figure 2 for Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images
Figure 3 for Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images
Figure 4 for Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images
Viaarxiv icon