Picture for Shuohang Wang

Shuohang Wang

Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning

Add code
Jul 02, 2024
Figure 1 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Figure 2 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Figure 3 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Figure 4 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Viaarxiv icon

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Add code
May 29, 2024
Viaarxiv icon

Multi-LoRA Composition for Image Generation

Add code
Feb 26, 2024
Figure 1 for Multi-LoRA Composition for Image Generation
Figure 2 for Multi-LoRA Composition for Image Generation
Figure 3 for Multi-LoRA Composition for Image Generation
Figure 4 for Multi-LoRA Composition for Image Generation
Viaarxiv icon

SciAgent: Tool-augmented Language Models for Scientific Reasoning

Add code
Feb 21, 2024
Figure 1 for SciAgent: Tool-augmented Language Models for Scientific Reasoning
Figure 2 for SciAgent: Tool-augmented Language Models for Scientific Reasoning
Figure 3 for SciAgent: Tool-augmented Language Models for Scientific Reasoning
Figure 4 for SciAgent: Tool-augmented Language Models for Scientific Reasoning
Viaarxiv icon

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

Add code
Oct 19, 2023
Figure 1 for Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Figure 2 for Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Figure 3 for Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Figure 4 for Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Viaarxiv icon

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions

Add code
Oct 19, 2023
Figure 1 for The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Figure 2 for The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Figure 3 for The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Figure 4 for The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Viaarxiv icon

Sparse Modular Activation for Efficient Sequence Modeling

Add code
Jul 11, 2023
Figure 1 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 2 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 3 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 4 for Sparse Modular Activation for Efficient Sequence Modeling
Viaarxiv icon

In-Context Demonstration Selection with Cross Entropy Difference

Add code
May 24, 2023
Figure 1 for In-Context Demonstration Selection with Cross Entropy Difference
Figure 2 for In-Context Demonstration Selection with Cross Entropy Difference
Figure 3 for In-Context Demonstration Selection with Cross Entropy Difference
Figure 4 for In-Context Demonstration Selection with Cross Entropy Difference
Viaarxiv icon

PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents

Add code
May 23, 2023
Figure 1 for PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Figure 2 for PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Figure 3 for PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Figure 4 for PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Viaarxiv icon

InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT

Add code
May 22, 2023
Figure 1 for InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Figure 2 for InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Figure 3 for InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Figure 4 for InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Viaarxiv icon