Picture for Weilun Zhao

Weilun Zhao

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Add code
Feb 20, 2025
Viaarxiv icon