Picture for Liangyou Li

Liangyou Li

Chain-of-Probe: Examing the Necessity and Accuracy of CoT Step-by-Step

Add code
Jun 23, 2024
Viaarxiv icon

Mitigating Large Language Model Hallucination with Faithful Finetuning

Add code
Jun 17, 2024
Figure 1 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 2 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 3 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 4 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Viaarxiv icon

Learning to Edit: Aligning LLMs with Knowledge Editing

Add code
Feb 19, 2024
Viaarxiv icon

MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models

Add code
Jan 30, 2024
Viaarxiv icon

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Add code
Nov 14, 2023
Figure 1 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 2 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 3 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Figure 4 for FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Viaarxiv icon

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models

Add code
Oct 30, 2023
Figure 1 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 2 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 3 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Figure 4 for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Viaarxiv icon

Aligning Large Language Models with Human: A Survey

Add code
Jul 24, 2023
Viaarxiv icon

Evaluating the Efficacy of Length-Controllable Machine Translation

Add code
May 03, 2023
Figure 1 for Evaluating the Efficacy of Length-Controllable Machine Translation
Figure 2 for Evaluating the Efficacy of Length-Controllable Machine Translation
Figure 3 for Evaluating the Efficacy of Length-Controllable Machine Translation
Figure 4 for Evaluating the Efficacy of Length-Controllable Machine Translation
Viaarxiv icon

End-to-end Training and Decoding for Pivot-based Cascaded Translation Model

Add code
May 03, 2023
Figure 1 for End-to-end Training and Decoding for Pivot-based Cascaded Translation Model
Figure 2 for End-to-end Training and Decoding for Pivot-based Cascaded Translation Model
Figure 3 for End-to-end Training and Decoding for Pivot-based Cascaded Translation Model
Figure 4 for End-to-end Training and Decoding for Pivot-based Cascaded Translation Model
Viaarxiv icon

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation

Add code
Dec 17, 2022
Figure 1 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 2 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 3 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 4 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Viaarxiv icon