Picture for Lin Ju

Lin Ju

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Add code
Mar 07, 2025
Figure 1 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 2 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 3 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 4 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Viaarxiv icon

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Add code
May 30, 2024
Figure 1 for Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Figure 2 for Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Figure 3 for Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Figure 4 for Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Viaarxiv icon

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

Add code
Apr 15, 2024
Figure 1 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Figure 2 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Figure 3 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Figure 4 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Viaarxiv icon

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster

Add code
Apr 15, 2024
Figure 1 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 2 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 3 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 4 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Viaarxiv icon

M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining

Add code
Feb 04, 2024
Figure 1 for M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining
Figure 2 for M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining
Figure 3 for M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining
Figure 4 for M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining
Viaarxiv icon

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems

Add code
Jan 09, 2024
Figure 1 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 2 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 3 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 4 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Viaarxiv icon

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training

Add code
Dec 19, 2023
Figure 1 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 2 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 3 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 4 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Viaarxiv icon

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Add code
Oct 09, 2023
Figure 1 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 2 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 3 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 4 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Viaarxiv icon

Trust in AutoML: Exploring Information Needs for Establishing Trust in Automated Machine Learning Systems

Add code
Jan 17, 2020
Figure 1 for Trust in AutoML: Exploring Information Needs for Establishing Trust in Automated Machine Learning Systems
Figure 2 for Trust in AutoML: Exploring Information Needs for Establishing Trust in Automated Machine Learning Systems
Figure 3 for Trust in AutoML: Exploring Information Needs for Establishing Trust in Automated Machine Learning Systems
Figure 4 for Trust in AutoML: Exploring Information Needs for Establishing Trust in Automated Machine Learning Systems
Viaarxiv icon