Picture for Baosong Yang

Baosong Yang

additional authors not shown

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

Add code
Apr 30, 2025
Viaarxiv icon

Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training

Add code
Apr 29, 2025
Viaarxiv icon

Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding

Add code
Mar 03, 2025
Viaarxiv icon

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Figure 1 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 2 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 3 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 4 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs

Add code
Nov 14, 2024
Figure 1 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 2 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 3 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 4 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Viaarxiv icon

ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese

Add code
Nov 09, 2024
Viaarxiv icon

Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation

Add code
Oct 29, 2024
Figure 1 for Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation
Figure 2 for Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation
Figure 3 for Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation
Figure 4 for Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation
Viaarxiv icon

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

Add code
Oct 17, 2024
Figure 1 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Figure 2 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Figure 3 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Figure 4 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Viaarxiv icon

Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning

Add code
Oct 03, 2024
Figure 1 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 2 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 3 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 4 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Viaarxiv icon