Picture for Yang Zhan

Yang Zhan

UAVBench and UAVIT-1M: Benchmarking and Enhancing MLLMs for Low-Altitude UAV Vision-Language Understanding

Add code
Mar 15, 2026
Viaarxiv icon

ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization

Add code
Mar 03, 2026
Viaarxiv icon

From Atoms to Trees: Building a Structured Feature Forest with Hierarchical Sparse Autoencoders

Add code
Feb 12, 2026
Viaarxiv icon

SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model

Add code
Jan 18, 2024
Figure 1 for SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
Figure 2 for SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
Figure 3 for SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
Figure 4 for SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
Viaarxiv icon

Mono3DVG: 3D Visual Grounding in Monocular Images

Add code
Dec 13, 2023
Viaarxiv icon

Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval

Add code
Aug 24, 2023
Figure 1 for Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Figure 2 for Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Figure 3 for Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Figure 4 for Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Viaarxiv icon

RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data

Add code
Oct 23, 2022
Figure 1 for RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Figure 2 for RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Figure 3 for RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Figure 4 for RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Viaarxiv icon