Picture for Han Guo

Han Guo

Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning

Add code
Jan 04, 2024
Figure 1 for Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning
Figure 2 for Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning
Figure 3 for Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning
Figure 4 for Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning
Viaarxiv icon

LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning

Add code
Nov 20, 2023
Figure 1 for LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Figure 2 for LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Figure 3 for LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Figure 4 for LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Viaarxiv icon

Recouple Event Field via Probabilistic Bias for Event Extraction

Add code
May 19, 2023
Figure 1 for Recouple Event Field via Probabilistic Bias for Event Extraction
Figure 2 for Recouple Event Field via Probabilistic Bias for Event Extraction
Figure 3 for Recouple Event Field via Probabilistic Bias for Event Extraction
Figure 4 for Recouple Event Field via Probabilistic Bias for Event Extraction
Viaarxiv icon

Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach

Add code
Feb 08, 2023
Figure 1 for Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach
Figure 2 for Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach
Figure 3 for Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach
Figure 4 for Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach
Viaarxiv icon

TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities

Add code
Dec 13, 2022
Figure 1 for TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Figure 2 for TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Figure 3 for TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Figure 4 for TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Viaarxiv icon

MPCFormer: fast, performant and private Transformer inference with MPC

Add code
Nov 02, 2022
Viaarxiv icon

RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning

Add code
May 25, 2022
Figure 1 for RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning
Figure 2 for RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning
Figure 3 for RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning
Figure 4 for RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning
Viaarxiv icon

Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation

Add code
Feb 24, 2022
Figure 1 for Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation
Figure 2 for Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation
Figure 3 for Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation
Figure 4 for Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation
Viaarxiv icon

Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification

Add code
Sep 21, 2021
Figure 1 for Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification
Figure 2 for Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification
Figure 3 for Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification
Figure 4 for Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification
Viaarxiv icon

Text Generation with Efficient (Soft) Q-Learning

Add code
Jun 17, 2021
Figure 1 for Text Generation with Efficient (Soft) Q-Learning
Figure 2 for Text Generation with Efficient (Soft) Q-Learning
Figure 3 for Text Generation with Efficient (Soft) Q-Learning
Figure 4 for Text Generation with Efficient (Soft) Q-Learning
Viaarxiv icon