Picture for Andrew Zhao

Andrew Zhao

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon

Optimizing Social Media Annotation of HPV Vaccine Skepticism and Misinformation Using Large Language Models: An Experimental Evaluation of In-Context Learning and Fine-Tuning Stance Detection Across Multiple Models

Add code
Nov 22, 2024
Figure 1 for Optimizing Social Media Annotation of HPV Vaccine Skepticism and Misinformation Using Large Language Models: An Experimental Evaluation of In-Context Learning and Fine-Tuning Stance Detection Across Multiple Models
Figure 2 for Optimizing Social Media Annotation of HPV Vaccine Skepticism and Misinformation Using Large Language Models: An Experimental Evaluation of In-Context Learning and Fine-Tuning Stance Detection Across Multiple Models
Figure 3 for Optimizing Social Media Annotation of HPV Vaccine Skepticism and Misinformation Using Large Language Models: An Experimental Evaluation of In-Context Learning and Fine-Tuning Stance Detection Across Multiple Models
Viaarxiv icon

Learning the structure of any Hamiltonian from minimal assumptions

Add code
Oct 29, 2024
Viaarxiv icon

LLM-based Optimization of Compound AI Systems: A Survey

Add code
Oct 21, 2024
Figure 1 for LLM-based Optimization of Compound AI Systems: A Survey
Figure 2 for LLM-based Optimization of Compound AI Systems: A Survey
Viaarxiv icon

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Add code
Jul 11, 2024
Figure 1 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 2 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 3 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 4 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Viaarxiv icon

Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories

Add code
Jun 25, 2024
Figure 1 for Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories
Figure 2 for Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories
Figure 3 for Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories
Figure 4 for Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories
Viaarxiv icon

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

Add code
May 29, 2024
Figure 1 for DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints
Figure 2 for DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints
Figure 3 for DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints
Figure 4 for DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints
Viaarxiv icon

Exploring Text-to-Motion Generation with Human Preference

Add code
Apr 15, 2024
Figure 1 for Exploring Text-to-Motion Generation with Human Preference
Figure 2 for Exploring Text-to-Motion Generation with Human Preference
Figure 3 for Exploring Text-to-Motion Generation with Human Preference
Figure 4 for Exploring Text-to-Motion Generation with Human Preference
Viaarxiv icon

Augmenting Unsupervised Reinforcement Learning with Self-Reference

Add code
Nov 16, 2023
Figure 1 for Augmenting Unsupervised Reinforcement Learning with Self-Reference
Figure 2 for Augmenting Unsupervised Reinforcement Learning with Self-Reference
Figure 3 for Augmenting Unsupervised Reinforcement Learning with Self-Reference
Figure 4 for Augmenting Unsupervised Reinforcement Learning with Self-Reference
Viaarxiv icon

Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation

Add code
Oct 06, 2023
Figure 1 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Figure 2 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Figure 3 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Figure 4 for Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Viaarxiv icon