Picture for Robby T. Tan

Robby T. Tan

Rethinking Human-Object Interaction Evaluation for both Vision-Language Models and HOI-Specific Methods

Add code
Aug 26, 2025
Viaarxiv icon

Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets

Add code
Jun 06, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Viaarxiv icon

Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation

Add code
Apr 04, 2025
Viaarxiv icon

Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors

Add code
Mar 26, 2025
Viaarxiv icon

3DSwapping: Texture Swapping For 3D Object From Single Reference Image

Add code
Mar 24, 2025
Viaarxiv icon

EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection

Add code
Oct 31, 2024
Figure 1 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 2 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 3 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Figure 4 for EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
Viaarxiv icon

VoiceBench: Benchmarking LLM-Based Voice Assistants

Add code
Oct 22, 2024
Figure 1 for VoiceBench: Benchmarking LLM-Based Voice Assistants
Figure 2 for VoiceBench: Benchmarking LLM-Based Voice Assistants
Figure 3 for VoiceBench: Benchmarking LLM-Based Voice Assistants
Figure 4 for VoiceBench: Benchmarking LLM-Based Voice Assistants
Viaarxiv icon

Representation Learning of Structured Data for Medical Foundation Models

Add code
Oct 17, 2024
Figure 1 for Representation Learning of Structured Data for Medical Foundation Models
Figure 2 for Representation Learning of Structured Data for Medical Foundation Models
Figure 3 for Representation Learning of Structured Data for Medical Foundation Models
Figure 4 for Representation Learning of Structured Data for Medical Foundation Models
Viaarxiv icon

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models

Add code
Sep 27, 2024
Figure 1 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 2 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 3 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 4 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Viaarxiv icon