Picture for Jaden Park

Jaden Park

MAOAM: Unified Object and Material Selection with Vision-Language Models

Add code
Jun 02, 2026
Viaarxiv icon

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Add code
Apr 14, 2026
Viaarxiv icon

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Add code
Nov 05, 2025
Viaarxiv icon

Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models

Add code
May 26, 2025
Viaarxiv icon

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Add code
Oct 15, 2024
Figure 1 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 2 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 3 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 4 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Viaarxiv icon