Picture for Huizhuo Yuan

Huizhuo Yuan

Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression

Add code
May 23, 2025
Viaarxiv icon

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Add code
May 23, 2025
Viaarxiv icon

Tensor Product Attention Is All You Need

Add code
Jan 11, 2025
Viaarxiv icon

Towards Simple and Provable Parameter-Free Adaptive Gradient Methods

Add code
Dec 27, 2024
Figure 1 for Towards Simple and Provable Parameter-Free Adaptive Gradient Methods
Figure 2 for Towards Simple and Provable Parameter-Free Adaptive Gradient Methods
Figure 3 for Towards Simple and Provable Parameter-Free Adaptive Gradient Methods
Figure 4 for Towards Simple and Provable Parameter-Free Adaptive Gradient Methods
Viaarxiv icon

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Add code
Nov 15, 2024
Figure 1 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 2 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 3 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 4 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Viaarxiv icon

Accelerated Preference Optimization for Large Language Model Alignment

Add code
Oct 08, 2024
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Figure 1 for Self-Play Preference Optimization for Language Model Alignment
Figure 2 for Self-Play Preference Optimization for Language Model Alignment
Figure 3 for Self-Play Preference Optimization for Language Model Alignment
Figure 4 for Self-Play Preference Optimization for Language Model Alignment
Viaarxiv icon

Protein Conformation Generation via Force-Guided SE(3) Diffusion Models

Add code
Mar 21, 2024
Figure 1 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 2 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 3 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 4 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Viaarxiv icon

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Add code
Feb 15, 2024
Figure 1 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 2 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 3 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 4 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Jan 02, 2024
Figure 1 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 2 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 3 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 4 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Viaarxiv icon