Picture for Boyi Zeng

Boyi Zeng

AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth

Add code
Mar 02, 2026
Viaarxiv icon

PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking

Add code
Mar 02, 2026
Viaarxiv icon

Pretraining with Token-Level Adaptive Latent Chain-of-Thought

Add code
Feb 09, 2026
Viaarxiv icon

Pretraining Language Models to Ponder in Continuous Space

Add code
May 27, 2025
Figure 1 for Pretraining Language Models to Ponder in Continuous Space
Figure 2 for Pretraining Language Models to Ponder in Continuous Space
Figure 3 for Pretraining Language Models to Ponder in Continuous Space
Figure 4 for Pretraining Language Models to Ponder in Continuous Space
Viaarxiv icon

FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension

Add code
May 01, 2025
Viaarxiv icon

GeoGalactica: A Scientific Large Language Model in Geoscience

Add code
Dec 31, 2023
Figure 1 for GeoGalactica: A Scientific Large Language Model in Geoscience
Figure 2 for GeoGalactica: A Scientific Large Language Model in Geoscience
Figure 3 for GeoGalactica: A Scientific Large Language Model in Geoscience
Figure 4 for GeoGalactica: A Scientific Large Language Model in Geoscience
Viaarxiv icon

HuRef: HUman-REadable Fingerprint for Large Language Models

Add code
Dec 08, 2023
Figure 1 for HuRef: HUman-REadable Fingerprint for Large Language Models
Figure 2 for HuRef: HUman-REadable Fingerprint for Large Language Models
Figure 3 for HuRef: HUman-REadable Fingerprint for Large Language Models
Figure 4 for HuRef: HUman-REadable Fingerprint for Large Language Models
Viaarxiv icon