Picture for Winston H. Hsu

Winston H. Hsu

FineBench: Benchmarking and Enhancing Vision-Language Models for Fine-grained Human Activity Understanding

Add code
May 20, 2026
Viaarxiv icon

SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning

Add code
May 18, 2026
Viaarxiv icon

SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization

Add code
May 14, 2026
Viaarxiv icon

VLN-NF: Feasibility-Aware Vision-and-Language Navigation with False-Premise Instructions

Add code
Apr 12, 2026
Viaarxiv icon

Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation

Add code
Nov 09, 2025
Figure 1 for Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation
Figure 2 for Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation
Figure 3 for Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation
Figure 4 for Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile Manipulation
Viaarxiv icon

MovieCORE: COgnitive REasoning in Movies

Add code
Aug 26, 2025
Figure 1 for MovieCORE: COgnitive REasoning in Movies
Figure 2 for MovieCORE: COgnitive REasoning in Movies
Figure 3 for MovieCORE: COgnitive REasoning in Movies
Figure 4 for MovieCORE: COgnitive REasoning in Movies
Viaarxiv icon

Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features

Add code
Feb 04, 2025
Viaarxiv icon

Leveraging Content and Context Cues for Low-Light Image Enhancement

Add code
Dec 10, 2024
Figure 1 for Leveraging Content and Context Cues for Low-Light Image Enhancement
Figure 2 for Leveraging Content and Context Cues for Low-Light Image Enhancement
Figure 3 for Leveraging Content and Context Cues for Low-Light Image Enhancement
Figure 4 for Leveraging Content and Context Cues for Low-Light Image Enhancement
Viaarxiv icon

Attention Tracker: Detecting Prompt Injection Attacks in LLMs

Add code
Nov 01, 2024
Figure 1 for Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Figure 2 for Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Figure 3 for Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Figure 4 for Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Viaarxiv icon

Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses

Add code
Sep 22, 2024
Figure 1 for Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Figure 2 for Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Figure 3 for Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Figure 4 for Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Viaarxiv icon