Picture for Hongchen Wei

Hongchen Wei

RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

Add code
May 29, 2025
Viaarxiv icon

Training-Free Reasoning and Reflection in MLLMs

Add code
May 22, 2025
Viaarxiv icon

LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models

Add code
Feb 21, 2025
Viaarxiv icon

Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model

Add code
Feb 19, 2025
Viaarxiv icon

Visual Context Window Extension: A New Perspective for Long Video Understanding

Add code
Sep 30, 2024
Viaarxiv icon

Improving Generalization of Image Captioning with Unsupervised Prompt Learning

Add code
Aug 05, 2023
Viaarxiv icon

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning

Add code
Oct 28, 2021
Figure 1 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Figure 2 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Figure 3 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Figure 4 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Viaarxiv icon