Picture for Yuhang Jiang

Yuhang Jiang

Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning

Add code
Feb 10, 2026
Viaarxiv icon

Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Unsupervised Data Generation for Offline Reinforcement Learning: A Perspective from Model

Add code
Jun 24, 2025
Viaarxiv icon

A Benchmark for End-to-End Zero-Shot Biomedical Relation Extraction with LLMs: Experiments with OpenAI Models

Add code
Apr 05, 2025
Figure 1 for A Benchmark for End-to-End Zero-Shot Biomedical Relation Extraction with LLMs: Experiments with OpenAI Models
Figure 2 for A Benchmark for End-to-End Zero-Shot Biomedical Relation Extraction with LLMs: Experiments with OpenAI Models
Figure 3 for A Benchmark for End-to-End Zero-Shot Biomedical Relation Extraction with LLMs: Experiments with OpenAI Models
Figure 4 for A Benchmark for End-to-End Zero-Shot Biomedical Relation Extraction with LLMs: Experiments with OpenAI Models
Viaarxiv icon

Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning

Add code
Dec 15, 2024
Figure 1 for Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Figure 2 for Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Figure 3 for Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Figure 4 for Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Viaarxiv icon

Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment

Add code
Nov 19, 2024
Figure 1 for Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Figure 2 for Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Figure 3 for Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Figure 4 for Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Viaarxiv icon

Doubly Mild Generalization for Offline Reinforcement Learning

Add code
Nov 13, 2024
Figure 1 for Doubly Mild Generalization for Offline Reinforcement Learning
Figure 2 for Doubly Mild Generalization for Offline Reinforcement Learning
Figure 3 for Doubly Mild Generalization for Offline Reinforcement Learning
Figure 4 for Doubly Mild Generalization for Offline Reinforcement Learning
Viaarxiv icon

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Add code
Oct 03, 2024
Figure 1 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Figure 2 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Figure 3 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Figure 4 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Viaarxiv icon

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks

Add code
Aug 20, 2024
Figure 1 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Figure 2 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Figure 3 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Figure 4 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Viaarxiv icon

LLM-Empowered State Representation for Reinforcement Learning

Add code
Jul 18, 2024
Figure 1 for LLM-Empowered State Representation for Reinforcement Learning
Figure 2 for LLM-Empowered State Representation for Reinforcement Learning
Figure 3 for LLM-Empowered State Representation for Reinforcement Learning
Figure 4 for LLM-Empowered State Representation for Reinforcement Learning
Viaarxiv icon