Picture for Shixiang Song

Shixiang Song

PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking

Add code
Mar 02, 2026
Viaarxiv icon

AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth

Add code
Mar 02, 2026
Viaarxiv icon

Pretraining with Token-Level Adaptive Latent Chain-of-Thought

Add code
Feb 09, 2026
Viaarxiv icon

Pretraining Language Models to Ponder in Continuous Space

Add code
May 27, 2025
Figure 1 for Pretraining Language Models to Ponder in Continuous Space
Figure 2 for Pretraining Language Models to Ponder in Continuous Space
Figure 3 for Pretraining Language Models to Ponder in Continuous Space
Figure 4 for Pretraining Language Models to Ponder in Continuous Space
Viaarxiv icon