Alert button
Picture for Lixing Zhu

Lixing Zhu

Alert button

Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding

Oct 28, 2023
Lixing Zhu, Runcong Zhao, Lin Gui, Yulan He

Narrative understanding involves capturing the author's cognitive processes, providing insights into their knowledge, intentions, beliefs, and desires. Although large language models (LLMs) excel in generating grammatically coherent text, their ability to comprehend the author's thoughts remains uncertain. This limitation hinders the practical applications of narrative understanding. In this paper, we conduct a comprehensive survey of narrative understanding tasks, thoroughly examining their key features, definitions, taxonomy, associated datasets, training objectives, evaluation metrics, and limitations. Furthermore, we explore the potential of expanding the capabilities of modularized LLMs to address novel narrative understanding tasks. By framing narrative understanding as the retrieval of the author's imaginative cues that outline the narrative structure, our study introduces a fresh perspective on enhancing narrative comprehension.

Viaarxiv icon

NarrativePlay: Interactive Narrative Understanding

Oct 02, 2023
Runcong Zhao, Wenjia Zhang, Jiazheng Li, Lixing Zhu, Yanran Li, Yulan He, Lin Gui

Figure 1 for NarrativePlay: Interactive Narrative Understanding
Figure 2 for NarrativePlay: Interactive Narrative Understanding
Figure 3 for NarrativePlay: Interactive Narrative Understanding
Figure 4 for NarrativePlay: Interactive Narrative Understanding

In this paper, we introduce NarrativePlay, a novel system that allows users to role-play a fictional character and interact with other characters in narratives such as novels in an immersive environment. We leverage Large Language Models (LLMs) to generate human-like responses, guided by personality traits extracted from narratives. The system incorporates auto-generated visual display of narrative settings, character portraits, and character speech, greatly enhancing user experience. Our approach eschews predefined sandboxes, focusing instead on main storyline events extracted from narratives from the perspective of a user-selected character. NarrativePlay has been evaluated on two types of narratives, detective and adventure stories, where users can either explore the world or improve their favorability with the narrative characters through conversations.

Viaarxiv icon

Can Prompt Learning Benefit Radiology Report Generation?

Aug 30, 2023
Jun Wang, Lixing Zhu, Abhir Bhalerao, Yulan He

Figure 1 for Can Prompt Learning Benefit Radiology Report Generation?
Figure 2 for Can Prompt Learning Benefit Radiology Report Generation?
Figure 3 for Can Prompt Learning Benefit Radiology Report Generation?
Figure 4 for Can Prompt Learning Benefit Radiology Report Generation?

Radiology report generation aims to automatically provide clinically meaningful descriptions of radiology images such as MRI and X-ray. Although great success has been achieved in natural scene image captioning tasks, radiology report generation remains challenging and requires prior medical knowledge. In this paper, we propose PromptRRG, a method that utilizes prompt learning to activate a pretrained model and incorporate prior knowledge. Since prompt learning for radiology report generation has not been explored before, we begin with investigating prompt designs and categorise them based on varying levels of knowledge: common, domain-specific and disease-enriched prompts. Additionally, we propose an automatic prompt learning mechanism to alleviate the burden of manual prompt engineering. This is the first work to systematically examine the effectiveness of prompt learning for radiology report generation. Experimental results on the largest radiology report generation benchmark, MIMIC-CXR, demonstrate that our proposed method achieves state-of-the-art performance. Code will be available upon the acceptance.

* 8 pages with 6 pages supplementary file 
Viaarxiv icon

PANACEA: An Automated Misinformation Detection System on COVID-19

Feb 28, 2023
Runcong Zhao, Miguel Arana-Catania, Lixing Zhu, Elena Kochkina, Lin Gui, Arkaitz Zubiaga, Rob Procter, Maria Liakata, Yulan He

Figure 1 for PANACEA: An Automated Misinformation Detection System on COVID-19
Figure 2 for PANACEA: An Automated Misinformation Detection System on COVID-19
Figure 3 for PANACEA: An Automated Misinformation Detection System on COVID-19
Figure 4 for PANACEA: An Automated Misinformation Detection System on COVID-19

In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporting evidence with the stance towards the claim to be checked. In addition, PANACEA adapts the bi-directional graph convolutional networks model, which is able to detect rumours based on comment networks of related tweets, instead of relying on the knowledge base. This rumour detection module assists by warning the users in the early stages when a knowledge base may not be available.

Viaarxiv icon

Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media

May 06, 2022
Lixing Zhu, Zheng Fang, Gabriele Pergola, Rob Procter, Yulan He

Figure 1 for Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media
Figure 2 for Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media
Figure 3 for Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media
Figure 4 for Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media

Building models to detect vaccine attitudes on social media is challenging because of the composite, often intricate aspects involved, and the limited availability of annotated data. Existing approaches have relied heavily on supervised training that requires abundant annotations and pre-defined aspect categories. Instead, with the aim of leveraging the large amount of unannotated data now available on vaccination, we propose a novel semi-supervised approach for vaccine attitude detection, called VADet. A variational autoencoding architecture based on language models is employed to learn from unlabelled data the topical information of the domain. Then, the model is fine-tuned with a few manually annotated examples of user attitudes. We validate the effectiveness of VADet on our annotated data and also on an existing vaccination corpus annotated with opinions on vaccines. Our results show that VADet is able to learn disentangled stance and aspect topics, and outperforms existing aspect-based sentiment analysis models on both stance detection and tweet clustering.

Viaarxiv icon

Optimal prediction for kernel-based semi-functional linear regression

Oct 29, 2021
Keli Guo, Jun Fan, Lixing Zhu

Figure 1 for Optimal prediction for kernel-based semi-functional linear regression
Figure 2 for Optimal prediction for kernel-based semi-functional linear regression
Figure 3 for Optimal prediction for kernel-based semi-functional linear regression

In this paper, we establish minimax optimal rates of convergence for prediction in a semi-functional linear model that consists of a functional component and a less smooth nonparametric component. Our results reveal that the smoother functional component can be learned with the minimax rate as if the nonparametric component were known. More specifically, a double-penalized least squares method is adopted to estimate both the functional and nonparametric components within the framework of reproducing kernel Hilbert spaces. By virtue of the representer theorem, an efficient algorithm that requires no iterations is proposed to solve the corresponding optimization problem, where the regularization parameters are selected by the generalized cross validation criterion. Numerical studies are provided to demonstrate the effectiveness of the method and to verify the theoretical analysis.

Viaarxiv icon

Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection

Jun 02, 2021
Lixing Zhu, Gabriele Pergola, Lin Gui, Deyu Zhou, Yulan He

Figure 1 for Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
Figure 2 for Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
Figure 3 for Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
Figure 4 for Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection

Emotion detection in dialogues is challenging as it often requires the identification of thematic topics underlying a conversation, the relevant commonsense knowledge, and the intricate transition patterns between the affective states. In this paper, we propose a Topic-Driven Knowledge-Aware Transformer to handle the challenges above. We firstly design a topic-augmented language model (LM) with an additional layer specialized for topic detection. The topic-augmented LM is then combined with commonsense statements derived from a knowledge base based on the dialogue contextual information. Finally, a transformer-based encoder-decoder architecture fuses the topical and commonsense information, and performs the emotion label sequence prediction. The model has been experimented on four datasets in dialogue emotion detection, demonstrating its superiority empirically over the existing state-of-the-art approaches. Quantitative and qualitative results show that the model can discover topics which help in distinguishing emotion categories.

Viaarxiv icon

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

Aug 11, 2020
Lixing Zhu, Yulan He, Deyu Zhou

Figure 1 for A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings
Figure 2 for A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings
Figure 3 for A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings
Figure 4 for A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

We propose a novel generative model to explore both local and global context for joint learning topics and topic-specific word embeddings. In particular, we assume that global latent topics are shared across documents, a word is generated by a hidden semantic vector encoding its contextual semantic meaning, and its context words are generated conditional on both the hidden semantic vector and global latent topics. Topics are trained jointly with the word embeddings. The trained model maps words to topic-dependent embeddings, which naturally addresses the issue of word polysemy. Experimental results show that the proposed model outperforms the word-level embedding methods in both word similarity evaluation and word sense disambiguation. Furthermore, the model also extracts more coherent topics compared with existing neural topic models or other models for joint learning of topics and word embeddings. Finally, the model can be easily integrated with existing deep contextualized word embedding learning methods to further improve the performance of downstream tasks such as sentiment classification.

Viaarxiv icon

Neural Temporal Opinion Modelling for Opinion Prediction on Twitter

May 27, 2020
Lixing Zhu, Yulan He, Deyu Zhou

Figure 1 for Neural Temporal Opinion Modelling for Opinion Prediction on Twitter
Figure 2 for Neural Temporal Opinion Modelling for Opinion Prediction on Twitter
Figure 3 for Neural Temporal Opinion Modelling for Opinion Prediction on Twitter
Figure 4 for Neural Temporal Opinion Modelling for Opinion Prediction on Twitter

Opinion prediction on Twitter is challenging due to the transient nature of tweet content and neighbourhood context. In this paper, we model users' tweet posting behaviour as a temporal point process to jointly predict the posting time and the stance label of the next tweet given a user's historical tweet sequence and tweets posted by their neighbours. We design a topic-driven attention mechanism to capture the dynamic topic shifts in the neighbourhood context. Experimental results show that the proposed model predicts both the posting time and the stance labels of future tweets more accurately compared to a number of competitive baselines.

* To appear at ACL2020 
Viaarxiv icon