Alert button
Picture for Yang Liu

Yang Liu

Alert button

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

Add code
Bookmark button
Alert button
Mar 14, 2024
Wei Shen, Xiaoying Zhang, Yuanshun Yao, Rui Zheng, Hongyi Guo, Yang Liu

Figure 1 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 2 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 3 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 4 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Viaarxiv icon

Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework

Add code
Bookmark button
Alert button
Mar 13, 2024
Jingling Li, Zeyu Tang, Xiaoyu Liu, Peter Spirtes, Kun Zhang, Liu Leqi, Yang Liu

Figure 1 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Figure 2 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Figure 3 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Figure 4 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Viaarxiv icon

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models

Add code
Bookmark button
Alert button
Mar 13, 2024
Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu

Figure 1 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Figure 2 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Figure 3 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Figure 4 for StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Viaarxiv icon

Learning to Watermark LLM-generated Text via Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 13, 2024
Xiaojun Xu, Yuanshun Yao, Yang Liu

Figure 1 for Learning to Watermark LLM-generated Text via Reinforcement Learning
Figure 2 for Learning to Watermark LLM-generated Text via Reinforcement Learning
Figure 3 for Learning to Watermark LLM-generated Text via Reinforcement Learning
Figure 4 for Learning to Watermark LLM-generated Text via Reinforcement Learning
Viaarxiv icon

ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval

Add code
Bookmark button
Alert button
Mar 11, 2024
Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan, Bin Wang

Figure 1 for ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
Figure 2 for ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
Figure 3 for ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
Figure 4 for ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
Viaarxiv icon

SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning

Add code
Bookmark button
Alert button
Mar 10, 2024
Maxence Boels, Yang Liu, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

Figure 1 for SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning
Figure 2 for SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning
Figure 3 for SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning
Figure 4 for SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning
Viaarxiv icon

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Add code
Bookmark button
Alert button
Mar 08, 2024
Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

Figure 1 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 2 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 3 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Viaarxiv icon

A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data

Add code
Bookmark button
Alert button
Mar 08, 2024
Yifan Wu, Yang Liu, Yue Yang, Michael S. Yao, Wenli Yang, Xuehui Shi, Lihong Yang, Dongjun Li, Yueming Liu, James C. Gee, Xuan Yang, Wenbin Wei, Shi Gu

Figure 1 for A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data
Figure 2 for A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data
Figure 3 for A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data
Figure 4 for A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data
Viaarxiv icon

Towards Multimodal Sentiment Analysis Debiasing via Bias Purification

Add code
Bookmark button
Alert button
Mar 08, 2024
Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhang

Figure 1 for Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
Figure 2 for Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
Figure 3 for Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
Figure 4 for Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
Viaarxiv icon

On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder

Add code
Bookmark button
Alert button
Mar 06, 2024
Tingxu Han, Shenghan Huang, Ziqi Ding, Weisong Sun, Yebo Feng, Chunrong Fang, Jun Li, Hanwei Qian, Cong Wu, Quanjun Zhang, Yang Liu, Zhenyu Chen

Figure 1 for On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder
Figure 2 for On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder
Figure 3 for On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder
Figure 4 for On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder
Viaarxiv icon