Picture for Tan-Hanh Pham

Tan-Hanh Pham

RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints

Add code
Jun 07, 2025
Viaarxiv icon

IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests

Add code
May 17, 2025
Viaarxiv icon

Missing Data Estimation for MR Spectroscopic Imaging via Mask-Free Deep Learning Methods

Add code
May 11, 2025
Viaarxiv icon

SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging

Add code
Apr 14, 2025
Viaarxiv icon

Predicting Space Tourism Demand Using Explainable AI

Add code
Mar 05, 2025
Viaarxiv icon

A Wearable Device Dataset for Mental Health Assessment Using Laser Doppler Flowmetry and Fluorescence Spectroscopy Sensors

Add code
Feb 03, 2025
Figure 1 for A Wearable Device Dataset for Mental Health Assessment Using Laser Doppler Flowmetry and Fluorescence Spectroscopy Sensors
Figure 2 for A Wearable Device Dataset for Mental Health Assessment Using Laser Doppler Flowmetry and Fluorescence Spectroscopy Sensors
Figure 3 for A Wearable Device Dataset for Mental Health Assessment Using Laser Doppler Flowmetry and Fluorescence Spectroscopy Sensors
Figure 4 for A Wearable Device Dataset for Mental Health Assessment Using Laser Doppler Flowmetry and Fluorescence Spectroscopy Sensors
Viaarxiv icon

SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization

Add code
Dec 21, 2024
Viaarxiv icon

Adaptive Compensation for Robotic Joint Failures Using Partially Observable Reinforcement Learning

Add code
Sep 22, 2024
Viaarxiv icon

MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder

Add code
Sep 21, 2024
Figure 1 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Figure 2 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Figure 3 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Figure 4 for MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Viaarxiv icon

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

Add code
Aug 08, 2024
Viaarxiv icon