Alert button
Picture for Yikang Shen

Yikang Shen

Alert button

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Add code
Bookmark button
Alert button
Apr 11, 2024
Yikang Shen, Zhen Guo, Tianle Cai, Zengyi Qin

Viaarxiv icon

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Add code
Bookmark button
Alert button
Apr 08, 2024
Bowen Pan, Yikang Shen, Haokun Liu, Mayank Mishra, Gaoyuan Zhang, Aude Oliva, Colin Raffel, Rameswar Panda

Viaarxiv icon

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Add code
Bookmark button
Alert button
Mar 14, 2024
Zhiqing Sun, Longhui Yu, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, Chuang Gan

Figure 1 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 2 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 3 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 4 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Viaarxiv icon

Scattered Mixture-of-Experts Implementation

Add code
Bookmark button
Alert button
Mar 13, 2024
Shawn Tan, Yikang Shen, Rameswar Panda, Aaron Courville

Figure 1 for Scattered Mixture-of-Experts Implementation
Figure 2 for Scattered Mixture-of-Experts Implementation
Figure 3 for Scattered Mixture-of-Experts Implementation
Figure 4 for Scattered Mixture-of-Experts Implementation
Viaarxiv icon

API Pack: A Massive Multilingual Dataset for API Call Generation

Add code
Bookmark button
Alert button
Feb 16, 2024
Zhen Guo, Adriana Meza Soria, Wei Sun, Yikang Shen, Rameswar Panda

Viaarxiv icon

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

Add code
Bookmark button
Alert button
Feb 04, 2024
Peiqi Wang, Yikang Shen, Zhen Guo, Matthew Stallone, Yoon Kim, Polina Golland, Rameswar Panda

Viaarxiv icon

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

Add code
Bookmark button
Alert button
Jan 30, 2024
Shun Zhang, Zhenfang Chen, Sunli Chen, Yikang Shen, Zhiqing Sun, Chuang Gan

Viaarxiv icon

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

Add code
Bookmark button
Alert button
Jan 19, 2024
Mayank Agarwal, Yikang Shen, Bailin Wang, Yoon Kim, Jie Chen

Viaarxiv icon

Gated Linear Attention Transformers with Hardware-Efficient Training

Add code
Bookmark button
Alert button
Dec 24, 2023
Songlin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, Yoon Kim

Viaarxiv icon

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Add code
Bookmark button
Alert button
Nov 06, 2023
Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan

Viaarxiv icon