Picture for Kexin Yang

Kexin Yang

additional authors not shown

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

DataMan: Data Manager for Pre-training Large Language Models

Add code
Feb 26, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

Qwen2.5-1M Technical Report

Add code
Jan 26, 2025
Figure 1 for Qwen2.5-1M Technical Report
Figure 2 for Qwen2.5-1M Technical Report
Figure 3 for Qwen2.5-1M Technical Report
Figure 4 for Qwen2.5-1M Technical Report
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Figure 1 for Qwen2.5 Technical Report
Figure 2 for Qwen2.5 Technical Report
Figure 3 for Qwen2.5 Technical Report
Figure 4 for Qwen2.5 Technical Report
Viaarxiv icon

Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors

Add code
Jul 31, 2024
Figure 1 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Figure 2 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Figure 3 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Figure 4 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Viaarxiv icon

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting

Add code
Mar 05, 2024
Figure 1 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 2 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 3 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 4 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Viaarxiv icon

OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models

Add code
Oct 25, 2023
Viaarxiv icon

MPPN: Multi-Resolution Periodic Pattern Network For Long-Term Time Series Forecasting

Add code
Jun 12, 2023
Viaarxiv icon