Picture for Zeyu Zheng

Zeyu Zheng

Normalization and effective learning rates in reinforcement learning

Add code
Jul 01, 2024
Figure 1 for Normalization and effective learning rates in reinforcement learning
Figure 2 for Normalization and effective learning rates in reinforcement learning
Figure 3 for Normalization and effective learning rates in reinforcement learning
Figure 4 for Normalization and effective learning rates in reinforcement learning
Viaarxiv icon

Daily Physical Activity Monitoring -- Adaptive Learning from Multi-source Motion Sensor Data

Add code
May 26, 2024
Viaarxiv icon

Understanding the performance gap between online and offline alignment algorithms

Add code
May 14, 2024
Viaarxiv icon

Large Language Model Enhanced Machine Learning Estimators for Classification

Add code
May 08, 2024
Figure 1 for Large Language Model Enhanced Machine Learning Estimators for Classification
Figure 2 for Large Language Model Enhanced Machine Learning Estimators for Classification
Viaarxiv icon

Collaborative Intelligence in Sequential Experiments: A Human-in-the-Loop Framework for Drug Discovery

Add code
May 07, 2024
Figure 1 for Collaborative Intelligence in Sequential Experiments: A Human-in-the-Loop Framework for Drug Discovery
Figure 2 for Collaborative Intelligence in Sequential Experiments: A Human-in-the-Loop Framework for Drug Discovery
Figure 3 for Collaborative Intelligence in Sequential Experiments: A Human-in-the-Loop Framework for Drug Discovery
Figure 4 for Collaborative Intelligence in Sequential Experiments: A Human-in-the-Loop Framework for Drug Discovery
Viaarxiv icon

Language Model Prompt Selection via Simulation Optimization

Add code
Apr 12, 2024
Figure 1 for Language Model Prompt Selection via Simulation Optimization
Figure 2 for Language Model Prompt Selection via Simulation Optimization
Figure 3 for Language Model Prompt Selection via Simulation Optimization
Figure 4 for Language Model Prompt Selection via Simulation Optimization
Viaarxiv icon

Human Alignment of Large Language Models through Online Preference Optimisation

Add code
Mar 13, 2024
Figure 1 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 2 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 3 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 4 for Human Alignment of Large Language Models through Online Preference Optimisation
Viaarxiv icon

Disentangling the Causes of Plasticity Loss in Neural Networks

Add code
Feb 29, 2024
Figure 1 for Disentangling the Causes of Plasticity Loss in Neural Networks
Figure 2 for Disentangling the Causes of Plasticity Loss in Neural Networks
Figure 3 for Disentangling the Causes of Plasticity Loss in Neural Networks
Figure 4 for Disentangling the Causes of Plasticity Loss in Neural Networks
Viaarxiv icon

Generalized Preference Optimization: A Unified Approach to Offline Alignment

Add code
Feb 08, 2024
Figure 1 for Generalized Preference Optimization: A Unified Approach to Offline Alignment
Figure 2 for Generalized Preference Optimization: A Unified Approach to Offline Alignment
Figure 3 for Generalized Preference Optimization: A Unified Approach to Offline Alignment
Figure 4 for Generalized Preference Optimization: A Unified Approach to Offline Alignment
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon