Picture for Yizhang Jin

Yizhang Jin

Improving General Role-Playing Agents via Psychology-Grounded Reasoning and Role-Aware Policy Optimization

Add code
Jun 25, 2026
Viaarxiv icon

Improving Search Agent with One Line of Code

Add code
Mar 10, 2026
Viaarxiv icon

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Figure 1 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Figure 2 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Figure 3 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Figure 4 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Figure 1 for Efficient Multimodal Large Language Models: A Survey
Figure 2 for Efficient Multimodal Large Language Models: A Survey
Figure 3 for Efficient Multimodal Large Language Models: A Survey
Figure 4 for Efficient Multimodal Large Language Models: A Survey
Viaarxiv icon

Generalized Category Discovery in Semantic Segmentation

Add code
Nov 20, 2023
Figure 1 for Generalized Category Discovery in Semantic Segmentation
Figure 2 for Generalized Category Discovery in Semantic Segmentation
Figure 3 for Generalized Category Discovery in Semantic Segmentation
Figure 4 for Generalized Category Discovery in Semantic Segmentation
Viaarxiv icon