Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marie-Francine Moens

NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties

Feb 02, 2024

Jingyuan Sun, Mingxiao Li, Zijiao Chen, Marie-Francine Moens

Figure 1 for NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties

Figure 2 for NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties

Figure 3 for NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties

Figure 4 for NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties

Abstract:In the pursuit to understand the intricacies of human brain's visual processing, reconstructing dynamic visual experiences from brain activities emerges as a challenging yet fascinating endeavor. While recent advancements have achieved success in reconstructing static images from non-invasive brain recordings, the domain of translating continuous brain activities into video format remains underexplored. In this work, we introduce NeuroCine, a novel dual-phase framework to targeting the inherent challenges of decoding fMRI data, such as noises, spatial redundancy and temporal lags. This framework proposes spatial masking and temporal interpolation-based augmentation for contrastive learning fMRI representations and a diffusion model enhanced by dependent prior noise for video generation. Tested on a publicly available fMRI dataset, our method shows promising results, outperforming the previous state-of-the-art models by a notable margin of ${20.97\%}$, ${31.00\%}$ and ${12.30\%}$ respectively on decoding the brain activities of three subjects in the fMRI dataset, as measured by SSIM. Additionally, our attention analysis suggests that the model aligns with existing brain structures and functions, indicating its biological plausibility and interpretability.

* under review

Via

Access Paper or Ask Questions

Explicitly Representing Syntax Improves Sentence-to-layout Prediction of Unexpected Situations

Jan 25, 2024

Wolf Nuyts, Ruben Cartuyvels, Marie-Francine Moens

Abstract:Recognizing visual entities in a natural language sentence and arranging them in a 2D spatial layout require a compositional understanding of language and space. This task of layout prediction is valuable in text-to-image synthesis as it allows localized and controlled in-painting of the image. In this comparative study it is shown that we can predict layouts from language representations that implicitly or explicitly encode sentence syntax, if the sentences mention similar entity-relationships to the ones seen during training. To test compositional understanding, we collect a test set of grammatically correct sentences and layouts describing compositions of entities and relations that unlikely have been seen during training. Performance on this test set substantially drops, showing that current models rely on correlations in the training data and have difficulties in understanding the structure of the input sentences. We propose a novel structural loss function that better enforces the syntactic structure of the input sentence and show large performance gains in the task of 2D spatial layout prediction conditioned on text. The loss has the potential to be used in other generation tasks where a tree-like structure underlies the conditioning modality. Code, trained models and the USCOCO evaluation set will be made available via github.

Via

Access Paper or Ask Questions

Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments

Nov 27, 2023

Liesbeth Allein, Maria Mihaela Truşcǎ, Marie-Francine Moens

Figure 1 for Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments

Figure 2 for Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments

Figure 3 for Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments

Figure 4 for Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments

Abstract:The social and implicit nature of human communication ramifies readers' understandings of written sentences. Single gold-standard interpretations rarely exist, challenging conventional assumptions in natural language processing. This work introduces the interpretation modeling (IM) task which involves modeling several interpretations of a sentence's underlying semantics to unearth layers of implicit meaning. To obtain these, IM is guided by multiple annotations of social relation and common ground - in this work approximated by reader attitudes towards the author and their understanding of moral judgments subtly embedded in the sentence. We propose a number of modeling strategies that rely on one-to-one and one-to-many generation methods that take inspiration from the philosophical study of interpretation. A first-of-its-kind IM dataset is curated to support experiments and analyses. The modeling results, coupled with scrutiny of the dataset, underline the challenges of IM as conflicting and complex interpretations are socially plausible. This interplay of diverse readings is affirmed by automated and human evaluations on the generated interpretations. Finally, toxicity analyses in the generated interpretations demonstrate the importance of IM for refining filters of content and assisting content moderators in safeguarding the safety in online discourse.

Via

Access Paper or Ask Questions

Text Augmentations with R-drop for Classification of Tweets Self Reporting Covid-19

Nov 06, 2023

Sumam Francis, Marie-Francine Moens

Abstract:This paper presents models created for the Social Media Mining for Health 2023 shared task. Our team addressed the first task, classifying tweets that self-report Covid-19 diagnosis. Our approach involves a classification model that incorporates diverse textual augmentations and utilizes R-drop to augment data and mitigate overfitting, boosting model efficacy. Our leading model, enhanced with R-drop and augmentations like synonym substitution, reserved words, and back translations, outperforms the task mean and median scores. Our system achieves an impressive F1 score of 0.877 on the test set.

* This paper has been peer-reviewed and accepted for presentation at SMM4H'23 at AMIA 2023 Annual Symposium

Via

Access Paper or Ask Questions

Injecting Categorical Labels and Syntactic Information into Biomedical NER

Nov 06, 2023

Sumam Francis, Marie-Francine Moens

Figure 1 for Injecting Categorical Labels and Syntactic Information into Biomedical NER

Figure 2 for Injecting Categorical Labels and Syntactic Information into Biomedical NER

Figure 3 for Injecting Categorical Labels and Syntactic Information into Biomedical NER

Abstract:We present a simple approach to improve biomedical named entity recognition (NER) by injecting categorical labels and Part-of-speech (POS) information into the model. We use two approaches, in the first approach, we first train a sequence-level classifier to classify the sentences into categories to obtain the sentence-level tags (categorical labels). The sequence classifier is modeled as an entailment problem by modifying the labels as a natural language template. This helps to improve the accuracy of the classifier. Further, this label information is injected into the NER model. In this paper, we demonstrate effective ways to represent and inject these labels and POS attributes into the NER model. In the second approach, we jointly learn the categorical labels and NER labels. Here we also inject the POS tags into the model to increase the syntactic context of the model. Experiments on three benchmark datasets show that incorporating categorical label information with syntactic context is quite useful and outperforms baseline BERT-based models.

* Proceedings of the 18th Conference on Computational Intelligence Methods for Bioinformatics & Biostatistics (CIBB 2023)

Via

Access Paper or Ask Questions

CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation

Oct 18, 2023

Philipp Borchert, Jochen De Weerdt, Kristof Coussement, Arno De Caigny, Marie-Francine Moens

Abstract:We introduce CORE, a dataset for few-shot relation classification (RC) focused on company relations and business entities. CORE includes 4,708 instances of 12 relation types with corresponding textual evidence extracted from company Wikipedia pages. Company names and business entities pose a challenge for few-shot RC models due to the rich and diverse information associated with them. For example, a company name may represent the legal entity, products, people, or business divisions depending on the context. Therefore, deriving the relation type between entities is highly dependent on textual context. To evaluate the performance of state-of-the-art RC models on the CORE dataset, we conduct experiments in the few-shot domain adaptation setting. Our results reveal substantial performance gaps, confirming that models trained on different domains struggle to adapt to CORE. Interestingly, we find that models trained on CORE showcase improved out-of-domain performance, which highlights the importance of high-quality data for robust domain adaptation. Specifically, the information richness embedded in business entities allows models to focus on contextual nuances, reducing their reliance on superficial clues such as relation-specific verbs. In addition to the dataset, we provide relevant code snippets to facilitate reproducibility and encourage further research in the field.

* Accepted to EMNLP 2023 main conference

Via

Access Paper or Ask Questions

Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

Oct 05, 2023

Jingyuan Sun, Xiaohan Zhang, Marie-Francine Moens

Figure 1 for Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

Figure 2 for Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

Figure 3 for Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

Figure 4 for Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language

Abstract:To understand the algorithm that supports the human brain's language representation, previous research has attempted to predict neural responses to linguistic stimuli using embeddings generated by artificial neural networks (ANNs), a process known as neural encoding. However, most of these studies have focused on probing neural representations of Germanic languages, such as English, with unsupervised ANNs. In this paper, we propose to bridge the gap between human brain and supervised ANN representations of the Chinese language. Specifically, we investigate how task tuning influences a pretained Transformer for neural encoding and which tasks lead to the best encoding performances. We generate supervised representations on eight Natural Language Understanding (NLU) tasks using prompt-tuning, a technique that is seldom explored in neural encoding for language. We demonstrate that prompt-tuning yields representations that better predict neural responses to Chinese stimuli than traditional fine-tuning on four tasks. Furthermore, we discover that tasks that require a fine-grained processing of concepts and entities lead to representations that are most predictive of brain activation patterns. Additionally, we reveal that the proportion of tuned parameters highly influences the neural encoding performance of fine-tuned models. Overall, our experimental findings could help us better understand the relationship between supervised artificial and brain language representations.

* ECAI 2023

Via

Access Paper or Ask Questions

Fine-tuned vs. Prompt-tuned Supervised Representations: Which Better Account for Brain Language Representations?

Oct 03, 2023

Jingyuan Sun, Marie-Francine Moens

Figure 1 for Fine-tuned vs. Prompt-tuned Supervised Representations: Which Better Account for Brain Language Representations?

Figure 2 for Fine-tuned vs. Prompt-tuned Supervised Representations: Which Better Account for Brain Language Representations?

Figure 3 for Fine-tuned vs. Prompt-tuned Supervised Representations: Which Better Account for Brain Language Representations?

Figure 4 for Fine-tuned vs. Prompt-tuned Supervised Representations: Which Better Account for Brain Language Representations?

Abstract:To decipher the algorithm underlying the human brain's language representation, previous work probed brain responses to language input with pre-trained artificial neural network (ANN) models fine-tuned on NLU tasks. However, full fine-tuning generally updates the entire parametric space and distorts pre-trained features, cognitively inconsistent with the brain's robust multi-task learning ability. Prompt-tuning, in contrast, protects pre-trained weights and learns task-specific embeddings to fit a task. Could prompt-tuning generate representations that better account for the brain's language representations than fine-tuning? If so, what kind of NLU task leads a pre-trained model to better decode the information represented in the human brain? We investigate these questions by comparing prompt-tuned and fine-tuned representations in neural decoding, that is predicting the linguistic stimulus from the brain activities evoked by the stimulus. We find that on none of the 10 NLU tasks, full fine-tuning significantly outperforms prompt-tuning in neural decoding, implicating that a more brain-consistent tuning method yields representations that better correlate with brain data. Moreover, we identify that tasks dealing with fine-grained concept meaning yield representations that better decode brain activation patterns than other tasks, especially the syntactic chunking task. This indicates that our brain encodes more fine-grained concept information than shallow syntactic information when representing languages.

* IJCAI 2023

Via

Access Paper or Ask Questions

Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence

Oct 02, 2023

Wei Sun, Mingxiao Li, Damien Sileo, Jesse Davis, Marie-Francine Moens

Figure 1 for Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence

Figure 2 for Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence

Figure 3 for Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence

Abstract:Medical Question Answering~(medical QA) systems play an essential role in assisting healthcare workers in finding answers to their questions. However, it is not sufficient to merely provide answers by medical QA systems because users might want explanations, that is, more analytic statements in natural language that describe the elements and context that support the answer. To do so, we propose a novel approach for generating natural language explanations for answers predicted by medical QA systems. As high-quality medical explanations require additional medical knowledge, so that our system extract knowledge from medical textbooks to enhance the quality of explanations during the explanation generation process. Concretely, we designed an expectation-maximization approach that makes inferences about the evidence found in these texts, offering an efficient way to focus attention on lengthy evidence passages. Experimental results, conducted on two datasets MQAE-diag and MQAE, demonstrate the effectiveness of our framework for reasoning with textual evidence. Our approach outperforms state-of-the-art models, achieving a significant improvement of \textbf{6.86} and \textbf{9.43} percentage points on the Rouge-1 score; \textbf{8.23} and \textbf{7.82} percentage points on the Bleu-4 score on the respective datasets.

Via

Access Paper or Ask Questions

Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

Sep 30, 2023

Jingyuan Sun, Mingxiao Li, Marie-Francine Moens

Figure 1 for Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

Figure 2 for Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

Figure 3 for Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

Figure 4 for Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion

Abstract:Reconstructing visual stimuli from human brain activities provides a promising opportunity to advance our understanding of the brain's visual system and its connection with computer vision models. Although deep generative models have been employed for this task, the challenge of generating high-quality images with accurate semantics persists due to the intricate underlying representations of brain signals and the limited availability of parallel data. In this paper, we propose a two-phase framework named Contrast and Diffuse (CnD) to decode realistic images from functional magnetic resonance imaging (fMRI) recordings. In the first phase, we acquire representations of fMRI data through self-supervised contrastive learning. In the second phase, the encoded fMRI representations condition the diffusion model to reconstruct visual stimulus through our proposed concept-aware conditioning method. Experimental results show that CnD reconstructs highly plausible images on challenging benchmarks. We also provide a quantitative interpretation of the connection between the latent diffusion model (LDM) components and the human brain's visual system. In summary, we present an effective approach for reconstructing visual stimuli based on human brain activity and offer a novel framework to understand the relationship between the diffusion model and the human brain visual system.

* 8 pages,5 figures,

Via

Access Paper or Ask Questions