Picture for Ruifeng Ren

Ruifeng Ren

Revisiting Transformers through the Lens of Low Entropy and Dynamic Sparsity

Add code
Apr 26, 2025
Viaarxiv icon

Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization

Add code
Feb 07, 2025
Viaarxiv icon

Can Mamba Always Enjoy the "Free Lunch"?

Add code
Oct 04, 2024
Viaarxiv icon

In-context Learning with Transformer Is Really Equivalent to a Contrastive Learning Pattern

Add code
Oct 20, 2023
Viaarxiv icon