Alert button
Picture for Xuanjing Huang

Xuanjing Huang

Alert button

Making Harmful Behaviors Unlearnable for Large Language Models

Add code
Bookmark button
Alert button
Nov 02, 2023
Xin Zhou, Yi Lu, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for Making Harmful Behaviors Unlearnable for Large Language Models
Figure 2 for Making Harmful Behaviors Unlearnable for Large Language Models
Figure 3 for Making Harmful Behaviors Unlearnable for Large Language Models
Figure 4 for Making Harmful Behaviors Unlearnable for Large Language Models
Viaarxiv icon

Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons

Add code
Bookmark button
Alert button
Oct 25, 2023
Tianlong Li, Xiaoqing Zheng, Xuanjing Huang

Viaarxiv icon

DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning

Add code
Bookmark button
Alert button
Oct 25, 2023
Wei Chen, Qiushi Wang, Zefei Long, Xianyin Zhang, Zhongtian Lu, Bingxuan Li, Siyuan Wang, Jiarong Xu, Xiang Bai, Xuanjing Huang, Zhongyu Wei

Figure 1 for DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning
Figure 2 for DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning
Figure 3 for DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning
Figure 4 for DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning
Viaarxiv icon

Unveiling A Core Linguistic Region in Large Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Jun Zhao, Zhihao Zhang, Yide Ma, Qi Zhang, Tao Gui, Luhui Gao, Xuanjing Huang

Figure 1 for Unveiling A Core Linguistic Region in Large Language Models
Figure 2 for Unveiling A Core Linguistic Region in Large Language Models
Figure 3 for Unveiling A Core Linguistic Region in Large Language Models
Figure 4 for Unveiling A Core Linguistic Region in Large Language Models
Viaarxiv icon

Orthogonal Subspace Learning for Language Model Continual Learning

Add code
Bookmark button
Alert button
Oct 22, 2023
Xiao Wang, Tianze Chen, Qiming Ge, Han Xia, Rong Bao, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang

Figure 1 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 2 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 3 for Orthogonal Subspace Learning for Language Model Continual Learning
Figure 4 for Orthogonal Subspace Learning for Language Model Continual Learning
Viaarxiv icon

Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization

Add code
Bookmark button
Alert button
Oct 19, 2023
Ningyu Xu, Qi Zhang, Jingting Ye, Menghan Zhang, Xuanjing Huang

Figure 1 for Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Figure 2 for Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Figure 3 for Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Figure 4 for Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Viaarxiv icon

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Oct 19, 2023
Wei Shen, Rui Zheng, Wenyu Zhan, Jun Zhao, Shihan Dou, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
Figure 2 for Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
Figure 3 for Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
Figure 4 for Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
Viaarxiv icon

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

Add code
Bookmark button
Alert button
Oct 19, 2023
Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Figure 2 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Figure 3 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Figure 4 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Viaarxiv icon

RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms

Add code
Bookmark button
Alert button
Oct 17, 2023
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms
Figure 2 for RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms
Figure 3 for RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms
Figure 4 for RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms
Viaarxiv icon

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks

Add code
Bookmark button
Alert button
Oct 17, 2023
Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, Jingjing Chen, Xuanjing Huang, Zhongyu Wei

Figure 1 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 2 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 3 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 4 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Viaarxiv icon