Picture for Yucheng Ding

Yucheng Ding

Sigma-MoE-Tiny Technical Report

Add code
Dec 19, 2025
Viaarxiv icon

SIGMA: An AI-Empowered Training Stack on Early-Life Hardware

Add code
Dec 15, 2025
Viaarxiv icon

Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training

Add code
Oct 09, 2025
Viaarxiv icon

Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization

Add code
Sep 30, 2025
Viaarxiv icon

Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

Add code
May 12, 2025
Figure 1 for Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Figure 2 for Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Figure 3 for Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Figure 4 for Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Viaarxiv icon

Collaborative Learning of On-Device Small Model and Cloud-Based Large Model: Advances and Future Directions

Add code
Apr 17, 2025
Viaarxiv icon

Personalized Language Model Learning on Text Data Without User Identifiers

Add code
Jan 10, 2025
Figure 1 for Personalized Language Model Learning on Text Data Without User Identifiers
Figure 2 for Personalized Language Model Learning on Text Data Without User Identifiers
Figure 3 for Personalized Language Model Learning on Text Data Without User Identifiers
Figure 4 for Personalized Language Model Learning on Text Data Without User Identifiers
Viaarxiv icon

DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models

Add code
Mar 18, 2023
Viaarxiv icon

Federated Submodel Averaging

Add code
Sep 28, 2021
Figure 1 for Federated Submodel Averaging
Figure 2 for Federated Submodel Averaging
Figure 3 for Federated Submodel Averaging
Figure 4 for Federated Submodel Averaging
Viaarxiv icon

Distributed Optimization over Block-Cyclic Data

Add code
Feb 18, 2020
Figure 1 for Distributed Optimization over Block-Cyclic Data
Figure 2 for Distributed Optimization over Block-Cyclic Data
Figure 3 for Distributed Optimization over Block-Cyclic Data
Figure 4 for Distributed Optimization over Block-Cyclic Data
Viaarxiv icon