Alert button
Picture for Yijia Zhang

Yijia Zhang

Alert button

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

Add code
Bookmark button
Alert button
Feb 16, 2024
Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu

Viaarxiv icon

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Add code
Bookmark button
Alert button
Nov 28, 2023
Jinfeng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wenjing Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

Figure 1 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Figure 2 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Figure 3 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Figure 4 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Viaarxiv icon

AFPQ: Asymmetric Floating Point Quantization for LLMs

Add code
Bookmark button
Alert button
Nov 03, 2023
Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu

Figure 1 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Figure 2 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Figure 3 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Figure 4 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Viaarxiv icon

A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations

Add code
Bookmark button
Alert button
Oct 31, 2023
Hui Ma, Jian Wang, Hongfei Lin, Bo Zhang, Yijia Zhang, Bo Xu

Viaarxiv icon

TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching

Add code
Bookmark button
Alert button
Aug 29, 2023
Yun Liao, Yide Di, Hao Zhou, Kaijun Zhu, Mingyu Lu, Yijia Zhang, Qing Duan, Junhui Liu

Figure 1 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Figure 2 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Figure 3 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Figure 4 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Viaarxiv icon

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

Add code
Bookmark button
Alert button
May 31, 2023
Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu

Figure 1 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Figure 2 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Figure 3 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Figure 4 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Viaarxiv icon

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models

Add code
Bookmark button
Alert button
May 21, 2023
Yijia Zhang, Lingran Zhao, Shijie Cao, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu

Figure 1 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Figure 2 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Figure 3 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Figure 4 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Viaarxiv icon

TC-GAT: Graph Attention Network for Temporal Causality Discovery

Add code
Bookmark button
Alert button
Apr 21, 2023
Xiaosong Yuan, Ke Chen, Wanli Zuo, Yijia Zhang

Figure 1 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Figure 2 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Figure 3 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Figure 4 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Viaarxiv icon

Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction

Add code
Bookmark button
Alert button
Jan 18, 2019
Yijia Zhang, Zhiyong Lu

Figure 1 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Figure 2 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Figure 3 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Figure 4 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Viaarxiv icon