Picture for Weilin Zhao

Weilin Zhao

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design

Add code
May 29, 2025
Viaarxiv icon

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Add code
Feb 20, 2025
Viaarxiv icon

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Add code
Feb 17, 2025
Viaarxiv icon

Densing Law of LLMs

Add code
Dec 05, 2024
Figure 1 for Densing Law of LLMs
Figure 2 for Densing Law of LLMs
Figure 3 for Densing Law of LLMs
Figure 4 for Densing Law of LLMs
Viaarxiv icon

Enabling Real-Time Conversations with Minimal Training Costs

Add code
Sep 18, 2024
Viaarxiv icon

Configurable Foundation Models: Building LLMs from a Modular Perspective

Add code
Sep 04, 2024
Figure 1 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 2 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 3 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 4 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Viaarxiv icon

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Add code
Aug 03, 2024
Figure 1 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 2 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 3 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 4 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Viaarxiv icon

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Add code
Jun 22, 2024
Figure 1 for Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Figure 2 for Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Figure 3 for Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Figure 4 for Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
Viaarxiv icon

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Add code
Apr 09, 2024
Figure 1 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 2 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 3 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 4 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Viaarxiv icon