Picture for Zesheng Shi

Zesheng Shi

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs

Add code
May 11, 2026
Viaarxiv icon

Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward

Add code
Apr 10, 2026
Viaarxiv icon

E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

Add code
Apr 10, 2026
Viaarxiv icon

Knowledge Grafting of Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Safety Alignment via Constrained Knowledge Unlearning

Add code
May 24, 2025
Viaarxiv icon

ReaderLM-v2: Small Language Model for HTML to Markdown and JSON

Add code
Mar 03, 2025
Figure 1 for ReaderLM-v2: Small Language Model for HTML to Markdown and JSON
Figure 2 for ReaderLM-v2: Small Language Model for HTML to Markdown and JSON
Figure 3 for ReaderLM-v2: Small Language Model for HTML to Markdown and JSON
Figure 4 for ReaderLM-v2: Small Language Model for HTML to Markdown and JSON
Viaarxiv icon