Alert button
Picture for Yiqun Yao

Yiqun Yao

Alert button

CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text

Add code
Bookmark button
Alert button
Mar 04, 2024
Zhenru Lin, Yiqun Yao, Yang Yuan

Figure 1 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Figure 2 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Figure 3 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Figure 4 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Viaarxiv icon

FLM-101B: An Open LLM and How to Train It with $100K Budget

Add code
Bookmark button
Alert button
Sep 17, 2023
Xiang Li, Yiqun Yao, Xin Jiang, Xuezhi Fang, Xuying Meng, Siqi Fan, Peng Han, Jing Li, Li Du, Bowen Qin, Zheng Zhang, Aixin Sun, Yequan Wang

Figure 1 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Figure 2 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Figure 3 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Figure 4 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Viaarxiv icon

2x Faster Language Model Pre-training via Masked Structural Growth

Add code
Bookmark button
Alert button
May 04, 2023
Yiqun Yao, Zheng Zhang, Jing Li, Yequan Wang

Figure 1 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 2 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 3 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 4 for 2x Faster Language Model Pre-training via Masked Structural Growth
Viaarxiv icon

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Add code
Bookmark button
Alert button
Apr 29, 2023
Yiqun Yao, Yequan Wang

Figure 1 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Figure 2 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Figure 3 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Figure 4 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Viaarxiv icon

MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task

Add code
Bookmark button
Alert button
May 17, 2021
Yiqun Yao, Michalis Papakostas, Mihai Burzo, Mohamed Abouelenien, Rada Mihalcea

Figure 1 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Figure 2 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Figure 3 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Figure 4 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Viaarxiv icon

Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks

Add code
Bookmark button
Alert button
Nov 15, 2018
Jing Shi, Jiaming Xu, Yiqun Yao, Bo Xu

Figure 1 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Figure 2 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Figure 3 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Figure 4 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Viaarxiv icon

Cascaded Mutual Modulation for Visual Reasoning

Add code
Bookmark button
Alert button
Sep 06, 2018
Yiqun Yao, Jiaming Xu, Feng Wang, Bo Xu

Figure 1 for Cascaded Mutual Modulation for Visual Reasoning
Figure 2 for Cascaded Mutual Modulation for Visual Reasoning
Figure 3 for Cascaded Mutual Modulation for Visual Reasoning
Figure 4 for Cascaded Mutual Modulation for Visual Reasoning
Viaarxiv icon

Hierarchical Memory Networks for Answer Selection on Unknown Words

Add code
Bookmark button
Alert button
Sep 28, 2016
Jiaming Xu, Jing Shi, Yiqun Yao, Suncong Zheng, Bo Xu, Bo Xu

Figure 1 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Figure 2 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Figure 3 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Figure 4 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Viaarxiv icon