Picture for Lili Mou

Lili Mou

RLFTSim: Realistic and Controllable Multi-Agent Traffic Simulation via Reinforcement Learning Fine-Tuning

Add code
May 18, 2026
Viaarxiv icon

Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling

Add code
Apr 05, 2026
Viaarxiv icon

Do Neural Networks Lose Plasticity in a Gradually Changing World?

Add code
Feb 09, 2026
Viaarxiv icon

Multi-Persona Thinking for Bias Mitigation in Large Language Models

Add code
Jan 21, 2026
Viaarxiv icon

LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits

Add code
Oct 30, 2025
Viaarxiv icon

RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization

Add code
Jul 31, 2025
Viaarxiv icon

KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation

Add code
Apr 26, 2025
Viaarxiv icon

A Decoding Algorithm for Length-Control Summarization Based on Directed Acyclic Transformers

Add code
Feb 06, 2025
Viaarxiv icon

ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization

Add code
Feb 06, 2025
Figure 1 for ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization
Figure 2 for ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization
Figure 3 for ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization
Figure 4 for ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization
Viaarxiv icon

Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)

Add code
Feb 06, 2025
Figure 1 for Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)
Figure 2 for Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)
Figure 3 for Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)
Viaarxiv icon