Picture for Qianyue Hao

Qianyue Hao

Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One

Add code
May 21, 2025
Viaarxiv icon

LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models

Add code
May 21, 2025
Viaarxiv icon

RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities

Add code
Jan 17, 2025
Figure 1 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Figure 2 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Figure 3 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Figure 4 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Viaarxiv icon

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Add code
Jan 16, 2025
Figure 1 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Figure 2 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Figure 3 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Figure 4 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Viaarxiv icon

A Survey on Human-Centric LLMs

Add code
Nov 26, 2024
Figure 1 for A Survey on Human-Centric LLMs
Figure 2 for A Survey on Human-Centric LLMs
Figure 3 for A Survey on Human-Centric LLMs
Figure 4 for A Survey on Human-Centric LLMs
Viaarxiv icon

HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction

Add code
Oct 10, 2024
Figure 1 for HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
Figure 2 for HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
Figure 3 for HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
Figure 4 for HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
Viaarxiv icon