Alert button
Picture for Yufeng Zhang

Yufeng Zhang

Alert button

School of Artificial Intelligence, Sun Yat-sen University, Zhuhai 519082, Guangdong Key Laboratory of Big Data Analysis and Processing, 510006, China

$\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model

Add code
Bookmark button
Alert button
Mar 11, 2024
Yufeng Zhang, Liyu Chen, Boyi Liu, Yingxiang Yang, Qiwen Cui, Yunzhe Tao, Hongxia Yang

Figure 1 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 2 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 3 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 4 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Viaarxiv icon

Can Large Language Models Play Games? A Case Study of A Self-Play Approach

Add code
Bookmark button
Alert button
Mar 08, 2024
Hongyi Guo, Zhihan Liu, Yufeng Zhang, Zhaoran Wang

Figure 1 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 2 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 3 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 4 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Viaarxiv icon

CPT: Competence-progressive Training Strategy for Few-shot Node Classification

Add code
Bookmark button
Alert button
Feb 01, 2024
Qilong Yan, Yufeng Zhang, Jinghao Zhang, Jingpu Duan, Jian Yin

Viaarxiv icon

Answering Subjective Induction Questions on Products by Summarizing Multi-sources Multi-viewpoints Knowledge

Add code
Bookmark button
Alert button
Sep 12, 2023
Yufeng Zhang, Meng-xiang Wang, Jianxing Yu

Viaarxiv icon

Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models

Add code
Bookmark button
Alert button
Aug 29, 2023
Lei Han, Qingxu Zhu, Jiapeng Sheng, Chong Zhang, Tingguang Li, Yizheng Zhang, He Zhang, Yuzhen Liu, Cheng Zhou, Rui Zhao, Jie Li, Yufeng Zhang, Rui Wang, Wanchao Chi, Xiong Li, Yonghui Zhu, Lingzhu Xiang, Xiao Teng, Zhengyou Zhang

Figure 1 for Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models
Figure 2 for Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models
Figure 3 for Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models
Figure 4 for Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models
Viaarxiv icon

Robust and Efficient Fault Diagnosis of mm-Wave Active Phased Arrays using Baseband Signal

Add code
Bookmark button
Alert button
Jun 07, 2023
Martin H. Nielsen, Yufeng Zhang, Changbin Xue, Jian Ren, Yingzeng Yin, Ming Shen, Gert F. Pedersen

Figure 1 for Robust and Efficient Fault Diagnosis of mm-Wave Active Phased Arrays using Baseband Signal
Figure 2 for Robust and Efficient Fault Diagnosis of mm-Wave Active Phased Arrays using Baseband Signal
Figure 3 for Robust and Efficient Fault Diagnosis of mm-Wave Active Phased Arrays using Baseband Signal
Figure 4 for Robust and Efficient Fault Diagnosis of mm-Wave Active Phased Arrays using Baseband Signal
Viaarxiv icon

What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

Add code
Bookmark button
Alert button
May 30, 2023
Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang

Figure 1 for What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Viaarxiv icon

An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models

Add code
Bookmark button
Alert button
Dec 30, 2022
Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang

Figure 1 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 2 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 3 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 4 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Viaarxiv icon

Image-Text Retrieval with Binary and Continuous Label Supervision

Add code
Bookmark button
Alert button
Oct 20, 2022
Zheng Li, Caili Guo, Zerun Feng, Jenq-Neng Hwang, Ying Jin, Yufeng Zhang

Figure 1 for Image-Text Retrieval with Binary and Continuous Label Supervision
Figure 2 for Image-Text Retrieval with Binary and Continuous Label Supervision
Figure 3 for Image-Text Retrieval with Binary and Continuous Label Supervision
Figure 4 for Image-Text Retrieval with Binary and Continuous Label Supervision
Viaarxiv icon

Disconnected Emerging Knowledge Graph Oriented Inductive Link Prediction

Add code
Bookmark button
Alert button
Sep 03, 2022
Yufeng Zhang, Weiqing Wang, Hongzhi Yin, Pengpeng Zhao, Wei Chen, Lei Zhao

Figure 1 for Disconnected Emerging Knowledge Graph Oriented Inductive Link Prediction
Figure 2 for Disconnected Emerging Knowledge Graph Oriented Inductive Link Prediction
Figure 3 for Disconnected Emerging Knowledge Graph Oriented Inductive Link Prediction
Figure 4 for Disconnected Emerging Knowledge Graph Oriented Inductive Link Prediction
Viaarxiv icon