Picture for Jing Li

Jing Li

LaF-GRPO: In-Situ Navigation Instruction Generation for the Visually Impaired via GRPO with LLM-as-Follower Reward

Add code
Jun 04, 2025
Viaarxiv icon

A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective

Add code
May 28, 2025
Viaarxiv icon

Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

Add code
May 28, 2025
Viaarxiv icon

Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing

Add code
May 28, 2025
Viaarxiv icon

Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer

Add code
May 24, 2025
Viaarxiv icon

Knowledge Grafting of Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Safety Alignment via Constrained Knowledge Unlearning

Add code
May 24, 2025
Viaarxiv icon

MTSA: Multi-turn Safety Alignment for LLMs through Multi-round Red-teaming

Add code
May 22, 2025
Viaarxiv icon

Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning

Add code
May 22, 2025
Viaarxiv icon

Multi-Modality Expansion and Retention for LLMs through Parameter Merging and Decoupling

Add code
May 21, 2025
Viaarxiv icon