Picture for Kunlong Chen

Kunlong Chen

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Add code
Jul 24, 2025
Viaarxiv icon

WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training

Add code
Jul 23, 2025
Viaarxiv icon

LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach

Add code
Sep 17, 2024
Figure 1 for LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach
Figure 2 for LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach
Figure 3 for LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach
Viaarxiv icon

GP-NAS-ensemble: a model for NAS Performance Prediction

Add code
Jan 23, 2023
Figure 1 for GP-NAS-ensemble: a model for NAS Performance Prediction
Figure 2 for GP-NAS-ensemble: a model for NAS Performance Prediction
Figure 3 for GP-NAS-ensemble: a model for NAS Performance Prediction
Figure 4 for GP-NAS-ensemble: a model for NAS Performance Prediction
Viaarxiv icon

DQN Control Solution for KDD Cup 2021 City Brain Challenge

Add code
Aug 14, 2021
Figure 1 for DQN Control Solution for KDD Cup 2021 City Brain Challenge
Figure 2 for DQN Control Solution for KDD Cup 2021 City Brain Challenge
Figure 3 for DQN Control Solution for KDD Cup 2021 City Brain Challenge
Figure 4 for DQN Control Solution for KDD Cup 2021 City Brain Challenge
Viaarxiv icon

Question Directed Graph Attention Network for Numerical Reasoning over Text

Add code
Sep 16, 2020
Figure 1 for Question Directed Graph Attention Network for Numerical Reasoning over Text
Figure 2 for Question Directed Graph Attention Network for Numerical Reasoning over Text
Figure 3 for Question Directed Graph Attention Network for Numerical Reasoning over Text
Figure 4 for Question Directed Graph Attention Network for Numerical Reasoning over Text
Viaarxiv icon

SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check

Add code
May 13, 2020
Figure 1 for SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check
Figure 2 for SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check
Figure 3 for SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check
Figure 4 for SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check
Viaarxiv icon

Symmetric Regularization based BERT for Pair-wise Semantic Reasoning

Add code
Sep 08, 2019
Figure 1 for Symmetric Regularization based BERT for Pair-wise Semantic Reasoning
Figure 2 for Symmetric Regularization based BERT for Pair-wise Semantic Reasoning
Figure 3 for Symmetric Regularization based BERT for Pair-wise Semantic Reasoning
Figure 4 for Symmetric Regularization based BERT for Pair-wise Semantic Reasoning
Viaarxiv icon

Toward Fast and Accurate Neural Chinese Word Segmentation with Multi-Criteria Learning

Add code
Mar 11, 2019
Figure 1 for Toward Fast and Accurate Neural Chinese Word Segmentation with Multi-Criteria Learning
Figure 2 for Toward Fast and Accurate Neural Chinese Word Segmentation with Multi-Criteria Learning
Figure 3 for Toward Fast and Accurate Neural Chinese Word Segmentation with Multi-Criteria Learning
Figure 4 for Toward Fast and Accurate Neural Chinese Word Segmentation with Multi-Criteria Learning
Viaarxiv icon

Convolutional Sequence to Sequence Non-intrusive Load Monitoring

Add code
Jun 06, 2018
Figure 1 for Convolutional Sequence to Sequence Non-intrusive Load Monitoring
Figure 2 for Convolutional Sequence to Sequence Non-intrusive Load Monitoring
Figure 3 for Convolutional Sequence to Sequence Non-intrusive Load Monitoring
Figure 4 for Convolutional Sequence to Sequence Non-intrusive Load Monitoring
Viaarxiv icon