Picture for Zhao Zhong

Zhao Zhong

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation

Add code
Aug 23, 2025
Viaarxiv icon

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Add code
Jul 29, 2025
Viaarxiv icon

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Add code
Jul 29, 2025
Viaarxiv icon

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

Add code
Dec 29, 2023
Figure 1 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 2 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 3 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 4 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Viaarxiv icon

Learning Low-Rank Representations for Model Compression

Add code
Nov 21, 2022
Viaarxiv icon

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

Add code
Nov 17, 2022
Figure 1 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 2 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 3 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 4 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Viaarxiv icon

Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs

Add code
Jul 08, 2021
Figure 1 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Figure 2 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Figure 3 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Figure 4 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Viaarxiv icon

Learning specialized activation functions with the Piecewise Linear Unit

Add code
Apr 08, 2021
Figure 1 for Learning specialized activation functions with the Piecewise Linear Unit
Figure 2 for Learning specialized activation functions with the Piecewise Linear Unit
Figure 3 for Learning specialized activation functions with the Piecewise Linear Unit
Figure 4 for Learning specialized activation functions with the Piecewise Linear Unit
Viaarxiv icon

FixNorm: Dissecting Weight Decay for Training Deep Neural Networks

Add code
Mar 29, 2021
Figure 1 for FixNorm: Dissecting Weight Decay for Training Deep Neural Networks
Figure 2 for FixNorm: Dissecting Weight Decay for Training Deep Neural Networks
Figure 3 for FixNorm: Dissecting Weight Decay for Training Deep Neural Networks
Figure 4 for FixNorm: Dissecting Weight Decay for Training Deep Neural Networks
Viaarxiv icon

AutoBSS: An Efficient Algorithm for Block Stacking Style Search

Add code
Oct 20, 2020
Figure 1 for AutoBSS: An Efficient Algorithm for Block Stacking Style Search
Figure 2 for AutoBSS: An Efficient Algorithm for Block Stacking Style Search
Figure 3 for AutoBSS: An Efficient Algorithm for Block Stacking Style Search
Figure 4 for AutoBSS: An Efficient Algorithm for Block Stacking Style Search
Viaarxiv icon