Picture for Shuqi Ke

Shuqi Ke

On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?

Add code
Feb 29, 2024
Figure 1 for On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?
Figure 2 for On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?
Figure 3 for On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?
Viaarxiv icon

How Can LLM Guide RL? A Value-Based Approach

Add code
Feb 25, 2024
Figure 1 for How Can LLM Guide RL? A Value-Based Approach
Figure 2 for How Can LLM Guide RL? A Value-Based Approach
Figure 3 for How Can LLM Guide RL? A Value-Based Approach
Figure 4 for How Can LLM Guide RL? A Value-Based Approach
Viaarxiv icon

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Add code
Oct 11, 2023
Figure 1 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 2 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 3 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 4 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Viaarxiv icon

Quantifying the Impact of Label Noise on Federated Learning

Add code
Nov 15, 2022
Figure 1 for Quantifying the Impact of Label Noise on Federated Learning
Figure 2 for Quantifying the Impact of Label Noise on Federated Learning
Figure 3 for Quantifying the Impact of Label Noise on Federated Learning
Figure 4 for Quantifying the Impact of Label Noise on Federated Learning
Viaarxiv icon