Speculating LLMs' Chinese Training Data Pollution from Their Tokens

Add code
Aug 25, 2025
Figure 1 for Speculating LLMs' Chinese Training Data Pollution from Their Tokens
Figure 2 for Speculating LLMs' Chinese Training Data Pollution from Their Tokens
Figure 3 for Speculating LLMs' Chinese Training Data Pollution from Their Tokens
Figure 4 for Speculating LLMs' Chinese Training Data Pollution from Their Tokens

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: