Picture for Robby T. Tan

Robby T. Tan

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Viaarxiv icon

Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation

Add code
Apr 04, 2025
Viaarxiv icon

Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors

Add code
Mar 26, 2025
Viaarxiv icon

3DSwapping: Texture Swapping For 3D Object From Single Reference Image

Add code
Mar 24, 2025
Viaarxiv icon

EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection

Add code
Oct 31, 2024
Figure 1 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 2 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 3 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 4 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Viaarxiv icon

VoiceBench: Benchmarking LLM-Based Voice Assistants

Add code
Oct 22, 2024
Viaarxiv icon

Representation Learning of Structured Data for Medical Foundation Models

Add code
Oct 17, 2024
Figure 1 for Representation Learning of Structured Data for Medical Foundation Models
Figure 2 for Representation Learning of Structured Data for Medical Foundation Models
Figure 3 for Representation Learning of Structured Data for Medical Foundation Models
Figure 4 for Representation Learning of Structured Data for Medical Foundation Models
Viaarxiv icon

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models

Add code
Sep 27, 2024
Figure 1 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 2 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 3 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 4 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Viaarxiv icon

uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

Add code
Aug 22, 2024
Figure 1 for uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization
Figure 2 for uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization
Figure 3 for uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization
Figure 4 for uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization
Viaarxiv icon

SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation

Add code
Aug 17, 2024
Figure 1 for SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation
Figure 2 for SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation
Figure 3 for SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation
Figure 4 for SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation
Viaarxiv icon