Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhizhan Zheng

Multi-Gate Residuals

May 22, 2026

Zhizhan Zheng, Feiyun Zhang, Shuchun Liu, Tian Xia, Xi Liu, Dasheng Hu, Hongquan Zhou

Abstract:While Attention Residuals has shown some effectiveness in addressing the widespread issue of unbounded activation growth across deep residual layers, it inevitably incurs significant communication overhead. To circumvent this bottleneck, we propose Multi-Gate Residuals (MGR), which stabilizes activation scales without additional communication burden. It utilizes a straightforward scoring and gating mechanism to maintain multi-stream context, coupled with Attention Pooling to extract hidden states from the stream states. Empirical experiments demonstrate that MGR is practical for large-scale training and deployment, offering tangible performance improvements over existing architectures.

Via

Access Paper or Ask Questions

Coconditional Autoencoding Adversarial Networks for Chinese Font Feature Learning

Dec 12, 2018

Zhizhan Zheng, Feiyun Zhang

Figure 1 for Coconditional Autoencoding Adversarial Networks for Chinese Font Feature Learning

Figure 2 for Coconditional Autoencoding Adversarial Networks for Chinese Font Feature Learning

Figure 3 for Coconditional Autoencoding Adversarial Networks for Chinese Font Feature Learning

Figure 4 for Coconditional Autoencoding Adversarial Networks for Chinese Font Feature Learning

Abstract:In this work, we propose a novel framework named Coconditional Autoencoding Adversarial Networks (CocoAAN) for Chinese font learning, which jointly learns a generation network and two encoding networks of different feature domains using an adversarial process. The encoding networks map the glyph images into style and content features respectively via the pairwise substitution optimization strategy, and the generation network maps these two kinds of features to glyph samples. Together with a discriminative network conditioned on the extracted features, our framework succeeds in producing realistic-looking Chinese glyph images flexibly. Unlike previous models relying on the complex segmentation of Chinese components or strokes, our model can "parse" structures in an unsupervised way, through which the content feature representation of each character is captured. Experiments demonstrate our framework has a powerful generalization capacity to other unseen fonts and characters.

Via

Access Paper or Ask Questions