Picture for Changxin Tian

Changxin Tian

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Add code
Jul 24, 2025
Viaarxiv icon

WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training

Add code
Jul 23, 2025
Viaarxiv icon

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Add code
Mar 07, 2025
Figure 1 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 2 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 3 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 4 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Viaarxiv icon

Can Small Language Models be Good Reasoners for Sequential Recommendation?

Add code
Mar 07, 2024
Figure 1 for Can Small Language Models be Good Reasoners for Sequential Recommendation?
Figure 2 for Can Small Language Models be Good Reasoners for Sequential Recommendation?
Figure 3 for Can Small Language Models be Good Reasoners for Sequential Recommendation?
Figure 4 for Can Small Language Models be Good Reasoners for Sequential Recommendation?
Viaarxiv icon

RecBole 2.0: Towards a More Up-to-Date Recommendation Library

Add code
Jun 16, 2022
Figure 1 for RecBole 2.0: Towards a More Up-to-Date Recommendation Library
Figure 2 for RecBole 2.0: Towards a More Up-to-Date Recommendation Library
Figure 3 for RecBole 2.0: Towards a More Up-to-Date Recommendation Library
Viaarxiv icon

Improving Graph Collaborative Filtering with Neighborhood-enriched Contrastive Learning

Add code
Feb 15, 2022
Figure 1 for Improving Graph Collaborative Filtering with Neighborhood-enriched Contrastive Learning
Figure 2 for Improving Graph Collaborative Filtering with Neighborhood-enriched Contrastive Learning
Figure 3 for Improving Graph Collaborative Filtering with Neighborhood-enriched Contrastive Learning
Figure 4 for Improving Graph Collaborative Filtering with Neighborhood-enriched Contrastive Learning
Viaarxiv icon