Picture for Shujian Huang

Shujian Huang

Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping

Add code
Jul 15, 2024
Viaarxiv icon

Large Language Models Are Cross-Lingual Knowledge-Free Reasoners

Add code
Jun 24, 2024
Figure 1 for Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Figure 2 for Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Figure 3 for Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Figure 4 for Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Viaarxiv icon

Limited Out-of-Context Knowledge Reasoning in Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon

Extroversion or Introversion? Controlling The Personality of Your Large Language Models

Add code
Jun 07, 2024
Viaarxiv icon

Large Language Models are Good Spontaneous Multilingual Learners: Is the Multilingual Annotated Data Necessary?

Add code
May 22, 2024
Viaarxiv icon

Why Not Transform Chat Large Language Models to Non-English?

Add code
May 22, 2024
Figure 1 for Why Not Transform Chat Large Language Models to Non-English?
Figure 2 for Why Not Transform Chat Large Language Models to Non-English?
Figure 3 for Why Not Transform Chat Large Language Models to Non-English?
Figure 4 for Why Not Transform Chat Large Language Models to Non-English?
Viaarxiv icon

The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights

Add code
May 02, 2024
Figure 1 for The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Figure 2 for The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Figure 3 for The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Figure 4 for The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Viaarxiv icon

Enforcing Paraphrase Generation via Controllable Latent Diffusion

Add code
Apr 13, 2024
Figure 1 for Enforcing Paraphrase Generation via Controllable Latent Diffusion
Figure 2 for Enforcing Paraphrase Generation via Controllable Latent Diffusion
Figure 3 for Enforcing Paraphrase Generation via Controllable Latent Diffusion
Figure 4 for Enforcing Paraphrase Generation via Controllable Latent Diffusion
Viaarxiv icon

Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly

Add code
Apr 06, 2024
Figure 1 for Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Figure 2 for Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Figure 3 for Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Figure 4 for Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Viaarxiv icon

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation

Add code
Apr 01, 2024
Figure 1 for MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
Figure 2 for MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
Figure 3 for MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
Figure 4 for MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
Viaarxiv icon