Picture for Yuhao Zhou

Yuhao Zhou

What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

Add code
Jul 08, 2024
Viaarxiv icon

Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations

Add code
Apr 24, 2024
Figure 1 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Figure 2 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Figure 3 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Figure 4 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Viaarxiv icon

Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals

Add code
Mar 24, 2024
Viaarxiv icon

Temporal Knowledge Graph Completion with Time-sensitive Relations in Hypercomplex Space

Add code
Mar 02, 2024
Figure 1 for Temporal Knowledge Graph Completion with Time-sensitive Relations in Hypercomplex Space
Figure 2 for Temporal Knowledge Graph Completion with Time-sensitive Relations in Hypercomplex Space
Figure 3 for Temporal Knowledge Graph Completion with Time-sensitive Relations in Hypercomplex Space
Figure 4 for Temporal Knowledge Graph Completion with Time-sensitive Relations in Hypercomplex Space
Viaarxiv icon

A Survey on Temporal Knowledge Graph: Representation Learning and Applications

Add code
Mar 02, 2024
Figure 1 for A Survey on Temporal Knowledge Graph: Representation Learning and Applications
Figure 2 for A Survey on Temporal Knowledge Graph: Representation Learning and Applications
Figure 3 for A Survey on Temporal Knowledge Graph: Representation Learning and Applications
Figure 4 for A Survey on Temporal Knowledge Graph: Representation Learning and Applications
Viaarxiv icon

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Add code
Feb 08, 2024
Viaarxiv icon

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

Add code
Feb 05, 2024
Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Add code
Jan 30, 2024
Viaarxiv icon

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Add code
Jan 12, 2024
Figure 1 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 2 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 3 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 4 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Viaarxiv icon

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Add code
Dec 18, 2023
Figure 1 for LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Figure 2 for LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Figure 3 for LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Figure 4 for LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Viaarxiv icon