Picture for Zhiyong Li

Zhiyong Li

BLaDA: Bridging Language to Functional Dexterous Actions within 3DGS Fields

Add code
Apr 09, 2026
Viaarxiv icon

Scene Graph-guided SegCaptioning Transformer with Fine-grained Alignment for Controllable Video Segmentation and Captioning

Add code
Mar 21, 2026
Viaarxiv icon

O3N: Omnidirectional Open-Vocabulary Occupancy Prediction

Add code
Mar 12, 2026
Viaarxiv icon

PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments

Add code
Mar 10, 2026
Viaarxiv icon

TRACER: Texture-Robust Affordance Chain-of-Thought for Deformable-Object Refinement

Add code
Jan 28, 2026
Viaarxiv icon

Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation

Add code
Jan 20, 2026
Viaarxiv icon

Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing

Add code
Dec 12, 2025
Viaarxiv icon

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

Add code
Nov 14, 2025
Viaarxiv icon

AVAM: Universal Training-free Adaptive Visual Anchoring Embedded into Multimodal Large Language Model for Multi-image Question Answering

Add code
Aug 25, 2025
Viaarxiv icon

Panoramic Out-of-Distribution Segmentation

Add code
May 06, 2025
Viaarxiv icon