Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xinying Qiu

COLA-GEC: A Bidirectional Framework for Enhancing Grammatical Acceptability and Error Correction

Jul 16, 2025

Xiangyu Yang, Xinying Qiu

Abstract:Grammatical Error Correction (GEC) and grammatical acceptability judgment (COLA) are core tasks in natural language processing, sharing foundational grammatical knowledge yet typically evolving independently. This paper introduces COLA-GEC, a novel bidirectional framework that enhances both tasks through mutual knowledge transfer. First, we augment grammatical acceptability models using GEC datasets, significantly improving their performance across multiple languages. Second, we integrate grammatical acceptability signals into GEC model training via a dynamic loss function, effectively guiding corrections toward grammatically acceptable outputs. Our approach achieves state-of-the-art results on several multilingual benchmarks. Comprehensive error analysis highlights remaining challenges, particularly in punctuation error correction, providing insights for future improvements in grammatical modeling.

* Accepted to CLNLP 2025

Via

Access Paper or Ask Questions

DualReward: A Dynamic Reinforcement Learning Framework for Cloze Tests Distractor Generation

Jul 16, 2025

Tianyou Huang, Xinglu Chen, Jingshen Zhang, Xinying Qiu, Ruiying Niu

Abstract:This paper introduces DualReward, a novel reinforcement learning framework for automatic distractor generation in cloze tests. Unlike conventional approaches that rely primarily on supervised learning or static generative models, our method employs a dual reward structure with adaptive scaling that differentiates between human-created gold standard distractors and model-generated candidates. The framework dynamically adjusts reward signal intensity based on model performance and confidence. We evaluate our approach on both passage-level (CLOTH-F) and sentence-level (MCQ) cloze test datasets, demonstrating consistent improvements over state-of-the-art baselines. Experimental results show that our adaptive reward scaling mechanism provides modest but consistent benefits on homogeneous datasets (CLOTH-F) and more substantial improvements (3.48-3.86% in P@1) on diverse, cross-domain data (MCQ), suggesting its particular effectiveness for handling varied question types and domains. Our work offers a flexible framework that effectively balances learning from reliable human examples while exploring novel, high-quality distractors for automated test generation.

* Accepted to CCL 2025

Via

Access Paper or Ask Questions

Cross-Domain Transfer and Few-Shot Learning for Personal Identifiable Information Recognition

Jul 16, 2025

Junhong Ye, Xu Yuan, Xinying Qiu

Abstract:Accurate recognition of personally identifiable information (PII) is central to automated text anonymization. This paper investigates the effectiveness of cross-domain model transfer, multi-domain data fusion, and sample-efficient learning for PII recognition. Using annotated corpora from healthcare (I2B2), legal (TAB), and biography (Wikipedia), we evaluate models across four dimensions: in-domain performance, cross-domain transferability, fusion, and few-shot learning. Results show legal-domain data transfers well to biographical texts, while medical domains resist incoming transfer. Fusion benefits are domain-specific, and high-quality recognition is achievable with only 10% of training data in low-specialization domains.

* Accepted to CLNLP 2025

Via

Access Paper or Ask Questions

Label Confidence Weighted Learning for Target-level Sentence Simplification

Oct 08, 2024

Xinying Qiu, Jingshen Zhang

Abstract:Multi-level sentence simplification generates simplified sentences with varying language proficiency levels. We propose Label Confidence Weighted Learning (LCWL), a novel approach that incorporates a label confidence weighting scheme in the training loss of the encoder-decoder model, setting it apart from existing confidence-weighting methods primarily designed for classification. Experimentation on English grade-level simplification dataset shows that LCWL outperforms state-of-the-art unsupervised baselines. Fine-tuning the LCWL model on in-domain data and combining with Symmetric Cross Entropy (SCE) consistently delivers better simplifications compared to strong supervised methods. Our results highlight the effectiveness of label confidence weighting techniques for text simplification tasks with encoder-decoder architectures.

* Accepted to EMNLP 2024

Via

Access Paper or Ask Questions

System Report for CCL24-Eval Task 7: Multi-Error Modeling and Fluency-Targeted Pre-training for Chinese Essay Evaluation

Jul 11, 2024

Jingshen Zhang, Xiangyu Yang, Xinkai Su, Xinglu Chen, Tianyou Huang, Xinying Qiu

Abstract:This system report presents our approaches and results for the Chinese Essay Fluency Evaluation (CEFE) task at CCL-2024. For Track 1, we optimized predictions for challenging fine-grained error types using binary classification models and trained coarse-grained models on the Chinese Learner 4W corpus. In Track 2, we enhanced performance by constructing a pseudo-dataset with multiple error types per sentence. For Track 3, where we achieved first place, we generated fluency-rated pseudo-data via back-translation for pre-training and used an NSP-based strategy with Symmetric Cross Entropy loss to capture context and mitigate long dependencies. Our methods effectively address key challenges in Chinese Essay Fluency Evaluation.

Via

Access Paper or Ask Questions

Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning

Jul 06, 2024

Jingshen Zhang, Xinying Qiu, Teng Shen, Wenyu Wang, Kailin Zhang, Wenhe Feng

Figure 1 for Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning

Figure 2 for Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning

Figure 3 for Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning

Figure 4 for Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning

Abstract:Cross-lingual word alignment plays a crucial role in various natural language processing tasks, particularly for low-resource languages. Recent study proposes a BiLSTM-based encoder-decoder model that outperforms pre-trained language models in low-resource settings. However, their model only considers the similarity of word embedding spaces and does not explicitly model the differences between word embeddings. To address this limitation, we propose incorporating contrastive learning into the BiLSTM-based encoder-decoder framework. Our approach introduces a multi-view negative sampling strategy to learn the differences between word pairs in the shared cross-lingual embedding space. We evaluate our model on five bilingual aligned datasets spanning four ASEAN languages: Lao, Vietnamese, Thai, and Indonesian. Experimental results demonstrate that integrating contrastive learning consistently improves word alignment accuracy across all datasets, confirming the effectiveness of the proposed method in low-resource scenarios. We will release our data set and code to support future research on ASEAN or more low-resource word alignment.

Via

Access Paper or Ask Questions

Comparing Feature-based and Context-aware Approaches to PII Generalization Level Prediction

Jul 03, 2024

Kailin Zhang, Xinying Qiu

Figure 1 for Comparing Feature-based and Context-aware Approaches to PII Generalization Level Prediction

Figure 2 for Comparing Feature-based and Context-aware Approaches to PII Generalization Level Prediction

Figure 3 for Comparing Feature-based and Context-aware Approaches to PII Generalization Level Prediction

Figure 4 for Comparing Feature-based and Context-aware Approaches to PII Generalization Level Prediction

Abstract:Protecting Personal Identifiable Information (PII) in text data is crucial for privacy, but current PII generalization methods face challenges such as uneven data distributions and limited context awareness. To address these issues, we propose two approaches: a feature-based method using machine learning to improve performance on structured inputs, and a novel context-aware framework that considers the broader context and semantic relationships between the original text and generalized candidates. The context-aware approach employs Multilingual-BERT for text representation, functional transformations, and mean squared error scoring to evaluate candidates. Experiments on the WikiReplace dataset demonstrate the effectiveness of both methods, with the context-aware approach outperforming the feature-based one across different scales. This work contributes to advancing PII generalization techniques by highlighting the importance of feature selection, ensemble learning, and incorporating contextual information for better privacy protection in text anonymization.

* Accepted to IALP 2024

Via

Access Paper or Ask Questions

Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese

Jun 05, 2024

Jingshen Zhang, Xinglu Chen, Xinying Qiu, Zhimin Wang, Wenhe Feng

Abstract:Chinese sentence simplification faces challenges due to the lack of large-scale labeled parallel corpora and the prevalence of idioms. To address these challenges, we propose Readability-guided Idiom-aware Sentence Simplification (RISS), a novel framework that combines data augmentation techniques with lexcial simplification. RISS introduces two key components: (1) Readability-guided Paraphrase Selection (RPS), a method for mining high-quality sentence pairs, and (2) Idiom-aware Simplification (IAS), a model that enhances the comprehension and simplification of idiomatic expressions. By integrating RPS and IAS using multi-stage and multi-task learning strategies, RISS outperforms previous state-of-the-art methods on two Chinese sentence simplification datasets. Furthermore, RISS achieves additional improvements when fine-tuned on a small labeled dataset. Our approach demonstrates the potential for more effective and accessible Chinese text simplification.

* Accepted to the 23rd China National Conference on Computational Linguistics (CCL 2024)

Via

Access Paper or Ask Questions

Controlling Cloze-test Question Item Difficulty with PLM-based Surrogate Models for IRT Assessment

Mar 03, 2024

Jingshen Zhang, Jiajun Xie, Xinying Qiu

Abstract:Item difficulty plays a crucial role in adaptive testing. However, few works have focused on generating questions of varying difficulty levels, especially for multiple-choice (MC) cloze tests. We propose training pre-trained language models (PLMs) as surrogate models to enable item response theory (IRT) assessment, avoiding the need for human test subjects. We also propose two strategies to control the difficulty levels of both the gaps and the distractors using ranking rules to reduce invalid distractors. Experimentation on a benchmark dataset demonstrates that our proposed framework and methods can effectively control and evaluate the difficulty levels of MC cloze tests.

Via

Access Paper or Ask Questions

Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring

Nov 24, 2022

Xinying Qiu, Shuxuan Liao, Jiajun Xie, Jian-Yun Nie

Figure 1 for Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring

Figure 2 for Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring

Figure 3 for Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring

Figure 4 for Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring

Abstract:In the prompt-specific holistic score prediction task for Automatic Essay Scoring, the general approaches include pre-trained neural model, coherence model, and hybrid model that incorporate syntactic features with neural model. In this paper, we propose a novel approach to extract and represent essay coherence features with prompt-learning NSP that shows to match the state-of-the-art AES coherence model, and achieves the best performance for long essays. We apply syntactic feature dense embedding to augment BERT-based model and achieve the best performance for hybrid methodology for AES. In addition, we explore various ideas to combine coherence, syntactic information and semantic embeddings, which no previous study has done before. Our combined model also performs better than the SOTA available for combined model, even though it does not outperform our syntactic enhanced neural model. We further offer analyses that can be useful for future study.

* Accepted to "2022 International Conference on Asian Language Processing (IALP)"

Via

Access Paper or Ask Questions