Picture for Baosong Yang

Baosong Yang

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting

Add code
Jun 25, 2024
Figure 1 for MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting
Figure 2 for MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting
Figure 3 for MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting
Figure 4 for MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting
Viaarxiv icon

AnyTrans: Translate AnyText in the Image with Large Scale Models

Add code
Jun 17, 2024
Figure 1 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Figure 2 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Figure 3 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Figure 4 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Viaarxiv icon

Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval

Add code
Jun 10, 2024
Figure 1 for Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
Figure 2 for Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
Figure 3 for Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
Figure 4 for Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
Viaarxiv icon

Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning

Add code
May 22, 2024
Figure 1 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Figure 2 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Figure 3 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Figure 4 for Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Viaarxiv icon

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning

Add code
Oct 26, 2023
Figure 1 for EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
Figure 2 for EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
Figure 3 for EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
Figure 4 for EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
Viaarxiv icon

PolyLM: An Open Source Polyglot Large Language Model

Add code
Jul 12, 2023
Figure 1 for PolyLM: An Open Source Polyglot Large Language Model
Figure 2 for PolyLM: An Open Source Polyglot Large Language Model
Figure 3 for PolyLM: An Open Source Polyglot Large Language Model
Figure 4 for PolyLM: An Open Source Polyglot Large Language Model
Viaarxiv icon

Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation

Add code
May 26, 2023
Figure 1 for Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
Figure 2 for Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
Figure 3 for Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
Figure 4 for Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
Viaarxiv icon

From Statistical Methods to Deep Learning, Automatic Keyphrase Prediction: A Survey

Add code
May 04, 2023
Figure 1 for From Statistical Methods to Deep Learning, Automatic Keyphrase Prediction: A Survey
Figure 2 for From Statistical Methods to Deep Learning, Automatic Keyphrase Prediction: A Survey
Figure 3 for From Statistical Methods to Deep Learning, Automatic Keyphrase Prediction: A Survey
Figure 4 for From Statistical Methods to Deep Learning, Automatic Keyphrase Prediction: A Survey
Viaarxiv icon

Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors

Add code
Feb 17, 2023
Figure 1 for Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors
Figure 2 for Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors
Figure 3 for Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors
Figure 4 for Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors
Viaarxiv icon