Picture for Jindong Li

Jindong Li

VESTA: A Fully Automated Scenario Generation and Safety Evaluation Framework for LLM Agents

Add code
Jun 07, 2026
Viaarxiv icon

CoSPlay: Cooperative Self-Play at Test-Time with Self-Generated Code and Unit Test

Add code
May 25, 2026
Viaarxiv icon

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Add code
May 12, 2026
Viaarxiv icon

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

Add code
May 12, 2026
Viaarxiv icon

Learning Less Is More: Premature Upper-Layer Attention Specialization Hurts Language Model Pretraining

Add code
May 11, 2026
Viaarxiv icon

Where Does Long-Context Supervision Actually Go? Effective-Context Exposure Balancing

Add code
May 11, 2026
Viaarxiv icon

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Add code
Feb 15, 2026
Viaarxiv icon

Enhancing IMU-Based Online Handwriting Recognition via Contrastive Learning with Zero Inference Overhead

Add code
Feb 04, 2026
Viaarxiv icon

AL-GNN: Privacy-Preserving and Replay-Free Continual Graph Learning via Analytic Learning

Add code
Dec 20, 2025
Viaarxiv icon

PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks

Add code
May 22, 2025
Figure 1 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Figure 2 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Figure 3 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Figure 4 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Viaarxiv icon