Picture for Yuan Qing

Yuan Qing

Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck

Add code
May 30, 2025
Viaarxiv icon

Multi-modal Instance Refinement for Cross-domain Action Recognition

Add code
Nov 24, 2023
Figure 1 for Multi-modal Instance Refinement for Cross-domain Action Recognition
Figure 2 for Multi-modal Instance Refinement for Cross-domain Action Recognition
Figure 3 for Multi-modal Instance Refinement for Cross-domain Action Recognition
Figure 4 for Multi-modal Instance Refinement for Cross-domain Action Recognition
Viaarxiv icon