Picture for Jiasi Shen

Jiasi Shen

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Add code
Feb 17, 2026
Viaarxiv icon

OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification

Add code
Apr 29, 2025
Figure 1 for OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
Figure 2 for OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
Figure 3 for OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
Figure 4 for OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
Viaarxiv icon