Picture for Ziyue Yang

Ziyue Yang

Sigma-MoE-Tiny Technical Report

Add code
Dec 19, 2025
Viaarxiv icon

SIGMA: An AI-Empowered Training Stack on Early-Life Hardware

Add code
Dec 15, 2025
Viaarxiv icon

ProgRM: Build Better GUI Agents with Progress Rewards

Add code
May 23, 2025
Viaarxiv icon

MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications

Add code
Apr 11, 2025
Figure 1 for MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
Figure 2 for MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
Figure 3 for MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
Figure 4 for MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
Viaarxiv icon

Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection

Add code
Feb 12, 2025
Figure 1 for Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection
Figure 2 for Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection
Figure 3 for Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection
Figure 4 for Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection
Viaarxiv icon

Optimizing Large Language Model Training Using FP4 Quantization

Add code
Jan 28, 2025
Figure 1 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 2 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 3 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 4 for Optimizing Large Language Model Training Using FP4 Quantization
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search

Add code
Oct 18, 2024
Figure 1 for SPFresh: Incremental In-Place Update for Billion-Scale Vector Search
Figure 2 for SPFresh: Incremental In-Place Update for Billion-Scale Vector Search
Figure 3 for SPFresh: Incremental In-Place Update for Billion-Scale Vector Search
Figure 4 for SPFresh: Incremental In-Place Update for Billion-Scale Vector Search
Viaarxiv icon

ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics

Add code
Feb 09, 2024
Figure 1 for ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics
Figure 2 for ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics
Figure 3 for ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics
Figure 4 for ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics
Viaarxiv icon

FP8-LM: Training FP8 Large Language Models

Add code
Oct 27, 2023
Figure 1 for FP8-LM: Training FP8 Large Language Models
Figure 2 for FP8-LM: Training FP8 Large Language Models
Figure 3 for FP8-LM: Training FP8 Large Language Models
Figure 4 for FP8-LM: Training FP8 Large Language Models
Viaarxiv icon