Alert button
Picture for Yuexin Wu

Yuexin Wu

Alert button

Distilling Text Style Transfer With Self-Explanation From LLMs

Add code
Bookmark button
Alert button
Mar 02, 2024
Chiyu Zhang, Honglong Cai, Yuezhang, Li, Yuexin Wu, Le Hou, Muhammad Abdul-Mageed

Figure 1 for Distilling Text Style Transfer With Self-Explanation From LLMs
Figure 2 for Distilling Text Style Transfer With Self-Explanation From LLMs
Figure 3 for Distilling Text Style Transfer With Self-Explanation From LLMs
Figure 4 for Distilling Text Style Transfer With Self-Explanation From LLMs
Viaarxiv icon

Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision

Add code
Bookmark button
Alert button
Feb 05, 2024
Zihan Wang, Yunxuan Li, Yuexin Wu, Liangchen Luo, Le Hou, Hongkun Yu, Jingbo Shang

Viaarxiv icon

Enable Language Models to Implicitly Learn Self-Improvement From Data

Add code
Bookmark button
Alert button
Oct 05, 2023
Ziqi Wang, Le Hou, Tianjian Lu, Yuexin Wu, Yunxuan Li, Hongkun Yu, Heng Ji

Figure 1 for Enable Language Models to Implicitly Learn Self-Improvement From Data
Figure 2 for Enable Language Models to Implicitly Learn Self-Improvement From Data
Figure 3 for Enable Language Models to Implicitly Learn Self-Improvement From Data
Figure 4 for Enable Language Models to Implicitly Learn Self-Improvement From Data
Viaarxiv icon

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

Add code
Bookmark button
Alert button
May 24, 2023
Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou

Figure 1 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 2 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 3 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Figure 4 for Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts
Viaarxiv icon

Token Imbalance Adaptation for Radiology Report Generation

Add code
Bookmark button
Alert button
Apr 18, 2023
Yuexin Wu, I-Chan Huang, Xiaolei Huang

Figure 1 for Token Imbalance Adaptation for Radiology Report Generation
Figure 2 for Token Imbalance Adaptation for Radiology Report Generation
Figure 3 for Token Imbalance Adaptation for Radiology Report Generation
Figure 4 for Token Imbalance Adaptation for Radiology Report Generation
Viaarxiv icon

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

Add code
Bookmark button
Alert button
Apr 11, 2023
Tao Lei, Junwen Bai, Siddhartha Brahma, Joshua Ainslie, Kenton Lee, Yanqi Zhou, Nan Du, Vincent Y. Zhao, Yuexin Wu, Bo Li, Yu Zhang, Ming-Wei Chang

Figure 1 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Figure 2 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Figure 3 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Figure 4 for Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Viaarxiv icon

Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification

Add code
Bookmark button
Alert button
May 26, 2022
Yuexin Wu, Xiaolei Huang

Figure 1 for Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification
Figure 2 for Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification
Figure 3 for Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification
Figure 4 for Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification
Viaarxiv icon

Token Dropping for Efficient BERT Pretraining

Add code
Bookmark button
Alert button
Mar 24, 2022
Le Hou, Richard Yuanzhe Pang, Tianyi Zhou, Yuexin Wu, Xinying Song, Xiaodan Song, Denny Zhou

Figure 1 for Token Dropping for Efficient BERT Pretraining
Figure 2 for Token Dropping for Efficient BERT Pretraining
Figure 3 for Token Dropping for Efficient BERT Pretraining
Figure 4 for Token Dropping for Efficient BERT Pretraining
Viaarxiv icon

Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance

Add code
Bookmark button
Alert button
Feb 24, 2022
Zhuoning Yuan, Yuexin Wu, Zihao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang

Figure 1 for Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
Figure 2 for Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
Figure 3 for Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
Figure 4 for Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
Viaarxiv icon

Generalized Multi-Relational Graph Convolution Network

Add code
Bookmark button
Alert button
Jun 12, 2020
Donghan Yu, Yiming Yang, Ruohong Zhang, Yuexin Wu

Figure 1 for Generalized Multi-Relational Graph Convolution Network
Figure 2 for Generalized Multi-Relational Graph Convolution Network
Figure 3 for Generalized Multi-Relational Graph Convolution Network
Figure 4 for Generalized Multi-Relational Graph Convolution Network
Viaarxiv icon