Picture for Xian Sun

Xian Sun

Duke University

Is Your Driving World Model an All-Around Player?

Add code
May 11, 2026
Viaarxiv icon

Masked Generative Transformer Is What You Need for Image Editing

Add code
May 11, 2026
Viaarxiv icon

RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning

Add code
Jul 28, 2025
Figure 1 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Figure 2 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Figure 3 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Figure 4 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Viaarxiv icon

ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation

Add code
Jul 03, 2025
Viaarxiv icon

A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning

Add code
Apr 16, 2025
Viaarxiv icon

RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation

Add code
Apr 04, 2025
Viaarxiv icon

SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World

Add code
Mar 20, 2025
Figure 1 for SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
Figure 2 for SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
Figure 3 for SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
Figure 4 for SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
Viaarxiv icon

SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing

Add code
Dec 17, 2024
Figure 1 for SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing
Figure 2 for SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing
Figure 3 for SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing
Figure 4 for SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing
Viaarxiv icon

RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model

Add code
Nov 27, 2024
Viaarxiv icon

NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation

Add code
Nov 13, 2024
Viaarxiv icon