Picture for Fengxiang Wang

Fengxiang Wang

OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data

Add code
May 29, 2025
Viaarxiv icon

GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution

Add code
May 27, 2025
Viaarxiv icon

TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series

Add code
May 13, 2025
Viaarxiv icon

Self-Supervised Enhancement of Forward-Looking Sonar Images: Bridging Cross-Modal Degradation Gaps through Feature Space Transformation and Multi-Frame Fusion

Add code
Apr 16, 2025
Viaarxiv icon

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

Add code
Mar 31, 2025
Viaarxiv icon

RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

Add code
Mar 13, 2025
Viaarxiv icon

Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency

Add code
Jan 09, 2025
Figure 1 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Figure 2 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Figure 3 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Figure 4 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Viaarxiv icon

MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue

Add code
Nov 06, 2024
Figure 1 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 2 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 3 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 4 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Viaarxiv icon

Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset

Add code
Jun 17, 2024
Viaarxiv icon

Efficient Prompt Tuning of Large Vision-Language Model for Fine-Grained Ship Classification

Add code
Mar 13, 2024
Viaarxiv icon