Picture for Zhiyong Li

Zhiyong Li

Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing

Add code
Dec 12, 2025
Viaarxiv icon

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

Add code
Nov 14, 2025
Viaarxiv icon

AVAM: Universal Training-free Adaptive Visual Anchoring Embedded into Multimodal Large Language Model for Multi-image Question Answering

Add code
Aug 25, 2025
Viaarxiv icon

Panoramic Out-of-Distribution Segmentation

Add code
May 06, 2025
Viaarxiv icon

HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors

Add code
Mar 10, 2025
Viaarxiv icon

TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping

Add code
Mar 04, 2025
Viaarxiv icon

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

Add code
Mar 04, 2025
Viaarxiv icon

Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance

Add code
Mar 04, 2025
Viaarxiv icon

One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes

Add code
Mar 03, 2025
Viaarxiv icon

Multi-Keypoint Affordance Representation for Functional Dexterous Grasping

Add code
Feb 27, 2025
Figure 1 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Figure 2 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Figure 3 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Figure 4 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Viaarxiv icon