Picture for Yang Gao

Yang Gao

Harry

FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing

Add code
Dec 30, 2025
Viaarxiv icon

LibContinual: A Comprehensive Library towards Realistic Continual Learning

Add code
Dec 26, 2025
Viaarxiv icon

Point What You Mean: Visually Grounded Instruction Policy

Add code
Dec 22, 2025
Figure 1 for Point What You Mean: Visually Grounded Instruction Policy
Figure 2 for Point What You Mean: Visually Grounded Instruction Policy
Figure 3 for Point What You Mean: Visually Grounded Instruction Policy
Figure 4 for Point What You Mean: Visually Grounded Instruction Policy
Viaarxiv icon

Translating Flow to Policy via Hindsight Online Imitation

Add code
Dec 22, 2025
Viaarxiv icon

MAGIC: Achieving Superior Model Merging via Magnitude Calibration

Add code
Dec 22, 2025
Figure 1 for MAGIC: Achieving Superior Model Merging via Magnitude Calibration
Figure 2 for MAGIC: Achieving Superior Model Merging via Magnitude Calibration
Figure 3 for MAGIC: Achieving Superior Model Merging via Magnitude Calibration
Figure 4 for MAGIC: Achieving Superior Model Merging via Magnitude Calibration
Viaarxiv icon

A Benchmark for Ultra-High-Resolution Remote Sensing MLLMs

Add code
Dec 19, 2025
Viaarxiv icon

ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation

Add code
Dec 18, 2025
Figure 1 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Figure 2 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Figure 3 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Figure 4 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Viaarxiv icon

Entropy-Guided Reasoning Compression

Add code
Nov 18, 2025
Figure 1 for Entropy-Guided Reasoning Compression
Figure 2 for Entropy-Guided Reasoning Compression
Figure 3 for Entropy-Guided Reasoning Compression
Figure 4 for Entropy-Guided Reasoning Compression
Viaarxiv icon

GateFuseNet: An Adaptive 3D Multimodal Neuroimaging Fusion Network for Parkinson's Disease Diagnosis

Add code
Oct 26, 2025
Viaarxiv icon

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Add code
Oct 10, 2025
Viaarxiv icon