Picture for Jing Li

Jing Li

Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination

Add code
May 11, 2026
Viaarxiv icon

Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework

Add code
May 11, 2026
Viaarxiv icon

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs

Add code
May 11, 2026
Viaarxiv icon

ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning

Add code
Apr 21, 2026
Viaarxiv icon

Dataset-Level Metrics Attenuate Non-Determinism: A Fine-Grained Non-Determinism Evaluation in Diffusion Language Models

Add code
Apr 15, 2026
Viaarxiv icon

E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

Add code
Apr 10, 2026
Viaarxiv icon

Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward

Add code
Apr 10, 2026
Viaarxiv icon

Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization

Add code
Apr 08, 2026
Viaarxiv icon

PRCCF: A Persona-guided Retrieval and Causal-aware Cognitive Filtering Framework for Emotional Support Conversation

Add code
Apr 02, 2026
Viaarxiv icon

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Add code
Mar 29, 2026
Viaarxiv icon