Picture for Shaoxiong Ji

Shaoxiong Ji

Lucky 52: How Many Languages Are Needed to Instruction Fine-Tune Large Language Models?

Add code
Apr 07, 2024
Viaarxiv icon

Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?

Add code
Mar 25, 2024
Viaarxiv icon

A New Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 20, 2024
Figure 1 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 2 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 3 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 4 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Viaarxiv icon

MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki

Add code
Mar 12, 2024
Figure 1 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Figure 2 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Figure 3 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Figure 4 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Viaarxiv icon

MaLA-500: Massive Language Adaptation of Large Language Models

Add code
Jan 24, 2024
Viaarxiv icon

Rethinking Large Language Models in Mental Health Applications

Add code
Nov 19, 2023
Viaarxiv icon

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Add code
Sep 16, 2023
Figure 1 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Figure 2 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Figure 3 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Figure 4 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Viaarxiv icon

Content Reduction, Surprisal and Information Density Estimation for Long Documents

Add code
Sep 12, 2023
Figure 1 for Content Reduction, Surprisal and Information Density Estimation for Long Documents
Figure 2 for Content Reduction, Surprisal and Information Density Estimation for Long Documents
Figure 3 for Content Reduction, Surprisal and Information Density Estimation for Long Documents
Figure 4 for Content Reduction, Surprisal and Information Density Estimation for Long Documents
Viaarxiv icon

A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge

Add code
Aug 09, 2023
Figure 1 for A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge
Figure 2 for A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge
Figure 3 for A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge
Figure 4 for A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge
Viaarxiv icon

Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health

Add code
Apr 20, 2023
Figure 1 for Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Figure 2 for Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Figure 3 for Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Figure 4 for Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Viaarxiv icon