Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ziheng Qiao

Mixture of Small and Large Models for Chinese Spelling Check

Jun 07, 2025

Ziheng Qiao, Houquan Zhou, Zhenghua Li

Figure 1 for Mixture of Small and Large Models for Chinese Spelling Check

Figure 2 for Mixture of Small and Large Models for Chinese Spelling Check

Figure 3 for Mixture of Small and Large Models for Chinese Spelling Check

Figure 4 for Mixture of Small and Large Models for Chinese Spelling Check

Abstract:In the era of large language models (LLMs), the Chinese Spelling Check (CSC) task has seen various LLM methods developed, yet their performance remains unsatisfactory. In contrast, fine-tuned BERT-based models, relying on high-quality in-domain data, show excellent performance but suffer from edit pattern overfitting. This paper proposes a novel dynamic mixture approach that effectively combines the probability distributions of small models and LLMs during the beam search decoding phase, achieving a balanced enhancement of precise corrections from small models and the fluency of LLMs. This approach also eliminates the need for fine-tuning LLMs, saving significant time and resources, and facilitating domain adaptation. Comprehensive experiments demonstrate that our mixture approach significantly boosts error correction capabilities, achieving state-of-the-art results across multiple datasets. Our code is available at https://github.com/zhqiao-nlp/MSLLM.

Via

Access Paper or Ask Questions

DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

Dec 17, 2024

Ziheng Qiao, Houquan Zhou, Yumeng Liu, Zhenghua Li, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang

Figure 1 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

Figure 2 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

Figure 3 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

Figure 4 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

Abstract:One key characteristic of the Chinese spelling check (CSC) task is that incorrect characters are usually similar to the correct ones in either phonetics or glyph. To accommodate this, previous works usually leverage confusion sets, which suffer from two problems, i.e., difficulty in determining which character pairs to include and lack of probabilities to distinguish items in the set. In this paper, we propose a light-weight plug-and-play DISC (i.e., decoding intervention with similarity of characters) module for CSC models.DISC measures phonetic and glyph similarities between characters and incorporates this similarity information only during the inference phase. This method can be easily integrated into various existing CSC models, such as ReaLiSe, SCOPE, and ReLM, without additional training costs. Experiments on three CSC benchmarks demonstrate that our proposed method significantly improves model performance, approaching and even surpassing the current state-of-the-art models.

Via

Access Paper or Ask Questions