Picture for Rui Zheng

Rui Zheng

What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

Add code
Jul 08, 2024
Viaarxiv icon

SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance

Add code
Jun 26, 2024
Viaarxiv icon

Aligning Large Language Models from Self-Reference AI Feedback with one General Principle

Add code
Jun 17, 2024
Figure 1 for Aligning Large Language Models from Self-Reference AI Feedback with one General Principle
Figure 2 for Aligning Large Language Models from Self-Reference AI Feedback with one General Principle
Figure 3 for Aligning Large Language Models from Self-Reference AI Feedback with one General Principle
Figure 4 for Aligning Large Language Models from Self-Reference AI Feedback with one General Principle
Viaarxiv icon

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

Add code
Jun 17, 2024
Figure 1 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 2 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 3 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 4 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Viaarxiv icon

Toward Optimal LLM Alignments Using Two-Player Games

Add code
Jun 16, 2024
Figure 1 for Toward Optimal LLM Alignments Using Two-Player Games
Figure 2 for Toward Optimal LLM Alignments Using Two-Player Games
Figure 3 for Toward Optimal LLM Alignments Using Two-Player Games
Figure 4 for Toward Optimal LLM Alignments Using Two-Player Games
Viaarxiv icon

Uncertainty Aware Learning for Language Model Alignment

Add code
Jun 07, 2024
Viaarxiv icon

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Add code
Jun 06, 2024
Figure 1 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 2 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 3 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 4 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Viaarxiv icon

Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation

Add code
May 25, 2024
Figure 1 for Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
Figure 2 for Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
Figure 3 for Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
Figure 4 for Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation
Viaarxiv icon

MetaRM: Shifted Distributions Alignment via Meta-Learning

Add code
May 01, 2024
Viaarxiv icon

Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals

Add code
Mar 24, 2024
Viaarxiv icon