Picture for Zhiyuan Fang

Zhiyuan Fang

DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge

Add code
Mar 19, 2026
Viaarxiv icon

FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space

Add code
Feb 02, 2026
Viaarxiv icon

Accurate Expert Predictions in MoE Inference via Cross-Layer Gate

Add code
Feb 17, 2025
Figure 1 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Figure 2 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Figure 3 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Figure 4 for Accurate Expert Predictions in MoE Inference via Cross-Layer Gate
Viaarxiv icon

Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline

Add code
Feb 09, 2025
Figure 1 for Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline
Figure 2 for Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline
Figure 3 for Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline
Figure 4 for Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline
Viaarxiv icon

Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

Add code
Mar 25, 2024
Figure 1 for Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
Figure 2 for Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
Figure 3 for Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
Figure 4 for Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
Viaarxiv icon

End-to-end Knowledge Retrieval with Multi-modal Queries

Add code
Jun 01, 2023
Figure 1 for End-to-end Knowledge Retrieval with Multi-modal Queries
Figure 2 for End-to-end Knowledge Retrieval with Multi-modal Queries
Figure 3 for End-to-end Knowledge Retrieval with Multi-modal Queries
Figure 4 for End-to-end Knowledge Retrieval with Multi-modal Queries
Viaarxiv icon

Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation

Add code
Nov 15, 2022
Figure 1 for Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Figure 2 for Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Figure 3 for Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Figure 4 for Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Viaarxiv icon

Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos

Add code
Apr 28, 2022
Figure 1 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Figure 2 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Figure 3 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Figure 4 for Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Viaarxiv icon

Injecting Semantic Concepts into End-to-End Image Captioning

Add code
Dec 09, 2021
Figure 1 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 2 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 3 for Injecting Semantic Concepts into End-to-End Image Captioning
Figure 4 for Injecting Semantic Concepts into End-to-End Image Captioning
Viaarxiv icon

Compressing Visual-linguistic Model via Knowledge Distillation

Add code
Apr 05, 2021
Figure 1 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 2 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 3 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 4 for Compressing Visual-linguistic Model via Knowledge Distillation
Viaarxiv icon