Picture for Xunjian Yin

Xunjian Yin

Themis: Towards Flexible and Interpretable NLG Evaluation

Add code
Jun 26, 2024
Viaarxiv icon

MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency

Add code
Jun 19, 2024
Viaarxiv icon

ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions

Add code
Jun 13, 2024
Figure 1 for ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Figure 2 for ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Figure 3 for ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Figure 4 for ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Viaarxiv icon

Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation

Add code
Feb 18, 2024
Figure 1 for Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation
Figure 2 for Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation
Figure 3 for Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation
Figure 4 for Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation
Viaarxiv icon

History Matters: Temporal Knowledge Editing in Large Language Model

Add code
Dec 14, 2023
Viaarxiv icon

ALCUNA: Large Language Models Meet New Knowledge

Add code
Oct 23, 2023
Viaarxiv icon

A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check

Add code
Jul 25, 2023
Figure 1 for A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check
Figure 2 for A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check
Figure 3 for A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check
Figure 4 for A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check
Viaarxiv icon

Human-like Summarization Evaluation with ChatGPT

Add code
Apr 05, 2023
Figure 1 for Human-like Summarization Evaluation with ChatGPT
Figure 2 for Human-like Summarization Evaluation with ChatGPT
Figure 3 for Human-like Summarization Evaluation with ChatGPT
Figure 4 for Human-like Summarization Evaluation with ChatGPT
Viaarxiv icon

Chinese Spelling Check with Nearest Neighbors

Add code
Nov 15, 2022
Figure 1 for Chinese Spelling Check with Nearest Neighbors
Figure 2 for Chinese Spelling Check with Nearest Neighbors
Figure 3 for Chinese Spelling Check with Nearest Neighbors
Figure 4 for Chinese Spelling Check with Nearest Neighbors
Viaarxiv icon