Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Runze Xia

Improving Brain-to-Image Reconstruction via Fine-Grained Text Bridging

May 29, 2025

Runze Xia, Shuo Feng, Renzhi Wang, Congchi Yin, Xuyun Wen, Piji Li

Abstract:Brain-to-Image reconstruction aims to recover visual stimuli perceived by humans from brain activity. However, the reconstructed visual stimuli often missing details and semantic inconsistencies, which may be attributed to insufficient semantic information. To address this issue, we propose an approach named Fine-grained Brain-to-Image reconstruction (FgB2I), which employs fine-grained text as bridge to improve image reconstruction. FgB2I comprises three key stages: detail enhancement, decoding fine-grained text descriptions, and text-bridged brain-to-image reconstruction. In the detail-enhancement stage, we leverage large vision-language models to generate fine-grained captions for visual stimuli and experimentally validate its importance. We propose three reward metrics (object accuracy, text-image semantic similarity, and image-image semantic similarity) to guide the language model in decoding fine-grained text descriptions from fMRI signals. The fine-grained text descriptions can be integrated into existing reconstruction methods to achieve fine-grained Brain-to-Image reconstruction.

* CogSci2025

Via

Access Paper or Ask Questions

Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information

Sep 30, 2024

Runze Xia, Congchi Yin, Piji Li

Figure 1 for Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information

Figure 2 for Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information

Figure 3 for Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information

Figure 4 for Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information

Abstract:The human visual system is capable of processing continuous streams of visual information, but how the brain encodes and retrieves recent visual memories during continuous visual processing remains unexplored. This study investigates the capacity of working memory to retain past information under continuous visual stimuli. And then we propose a new task Memory Disentangling, which aims to extract and decode past information from fMRI signals. To address the issue of interference from past memory information, we design a disentangled contrastive learning method inspired by the phenomenon of proactive interference. This method separates the information between adjacent fMRI signals into current and past components and decodes them into image descriptions. Experimental results demonstrate that this method effectively disentangles the information within fMRI signals. This research could advance brain-computer interfaces and mitigate the problem of low temporal resolution in fMRI.

* Accepted at Main Conference of EMNLP 2024

Via

Access Paper or Ask Questions