Picture for Wataru Kawakami

Wataru Kawakami

Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization

Add code
Apr 25, 2025
Figure 1 for Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization
Figure 2 for Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization
Figure 3 for Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization
Figure 4 for Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization
Viaarxiv icon