Picture for Boyang Li

Boyang Li

PointArena: Probing Multimodal Grounding Through Language-Guided Pointing

Add code
May 15, 2025
Viaarxiv icon

Reach-Avoid-Stabilize Using Admissible Control Sets

Add code
May 14, 2025
Viaarxiv icon

Solving Reach- and Stabilize-Avoid Problems Using Discounted Reachability

Add code
May 14, 2025
Viaarxiv icon

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

Add code
May 14, 2025
Viaarxiv icon

Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data

Add code
Mar 03, 2025
Viaarxiv icon

Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts

Add code
Jan 25, 2025
Figure 1 for Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Figure 2 for Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Figure 3 for Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Figure 4 for Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Viaarxiv icon

SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models

Add code
Dec 17, 2024
Viaarxiv icon

Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses

Add code
Dec 11, 2024
Viaarxiv icon

Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events

Add code
Dec 07, 2024
Viaarxiv icon

Paint Outside the Box: Synthesizing and Selecting Training Data for Visual Grounding

Add code
Dec 01, 2024
Viaarxiv icon