Picture for Sushant Gautam

Sushant Gautam

Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy

Add code
Jun 11, 2025
Viaarxiv icon

Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding

Add code
May 22, 2025
Viaarxiv icon

Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models

Add code
May 12, 2025
Viaarxiv icon

Prompt to Polyp: Clinically-Aware Medical Image Synthesis with Diffusion Models

Add code
May 08, 2025
Viaarxiv icon

X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization

Add code
Apr 10, 2025
Viaarxiv icon

Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis

Add code
Nov 20, 2024
Figure 1 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Figure 2 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Figure 3 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Figure 4 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Viaarxiv icon

Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study

Add code
Sep 26, 2024
Viaarxiv icon

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Add code
Sep 02, 2024
Figure 1 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Figure 2 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Figure 3 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Figure 4 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Viaarxiv icon

PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips

Add code
Jul 22, 2024
Figure 1 for PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips
Figure 2 for PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips
Figure 3 for PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips
Figure 4 for PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips
Viaarxiv icon