Picture for Christos Thrampoulidis

Christos Thrampoulidis

Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Add code
Mar 21, 2026
Viaarxiv icon

Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features

Add code
Jan 17, 2026
Viaarxiv icon

Short-Context Dominance: How Much Local Context Natural Language Actually Needs?

Add code
Dec 08, 2025
Figure 1 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Figure 2 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Figure 3 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Figure 4 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Viaarxiv icon

Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection

Add code
Nov 17, 2025
Figure 1 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 2 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 3 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 4 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Viaarxiv icon

How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data

Add code
Oct 27, 2025
Figure 1 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 2 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 3 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 4 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Viaarxiv icon

Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients

Add code
Oct 27, 2025
Viaarxiv icon

Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs

Add code
Aug 13, 2025
Figure 1 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Figure 2 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Figure 3 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Figure 4 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Viaarxiv icon

In-Context Occam's Razor: How Transformers Prefer Simpler Hypotheses on the Fly

Add code
Jun 24, 2025
Viaarxiv icon

On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization

Add code
May 24, 2025
Viaarxiv icon

On the Geometry of Semantics in Next-token Prediction

Add code
May 13, 2025
Figure 1 for On the Geometry of Semantics in Next-token Prediction
Figure 2 for On the Geometry of Semantics in Next-token Prediction
Figure 3 for On the Geometry of Semantics in Next-token Prediction
Figure 4 for On the Geometry of Semantics in Next-token Prediction
Viaarxiv icon