Picture for Yixiao Li

Yixiao Li

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

Add code
Jun 04, 2024
Viaarxiv icon

DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking

Add code
Mar 25, 2024
Figure 1 for DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking
Figure 2 for DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking
Figure 3 for DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking
Figure 4 for DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking
Viaarxiv icon

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Add code
Oct 23, 2023
Figure 1 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 2 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 3 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 4 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Viaarxiv icon

Deep Reinforcement Learning from Hierarchical Weak Preference Feedback

Add code
Sep 06, 2023
Figure 1 for Deep Reinforcement Learning from Hierarchical Weak Preference Feedback
Figure 2 for Deep Reinforcement Learning from Hierarchical Weak Preference Feedback
Figure 3 for Deep Reinforcement Learning from Hierarchical Weak Preference Feedback
Figure 4 for Deep Reinforcement Learning from Hierarchical Weak Preference Feedback
Viaarxiv icon

LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

Add code
Jun 26, 2023
Figure 1 for LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation
Figure 2 for LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation
Figure 3 for LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation
Figure 4 for LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation
Viaarxiv icon

A Review of Changepoint Detection Models

Add code
Aug 20, 2019
Viaarxiv icon