Picture for Christos Thrampoulidis

Christos Thrampoulidis

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models

Add code
May 24, 2026
Viaarxiv icon

The Implicit Bias of Depth: From Neural Collapse to Softmax Codes

Add code
May 21, 2026
Viaarxiv icon

High-Dimensional Statistics: Reflections on Progress and Open Problems

Add code
May 06, 2026
Viaarxiv icon

Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Add code
Mar 21, 2026
Viaarxiv icon

Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features

Add code
Jan 17, 2026
Viaarxiv icon

Short-Context Dominance: How Much Local Context Natural Language Actually Needs?

Add code
Dec 08, 2025
Figure 1 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Figure 2 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Figure 3 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Figure 4 for Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
Viaarxiv icon

Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection

Add code
Nov 17, 2025
Figure 1 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 2 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 3 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Figure 4 for Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Viaarxiv icon

Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients

Add code
Oct 27, 2025
Viaarxiv icon

How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data

Add code
Oct 27, 2025
Figure 1 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 2 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 3 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 4 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Viaarxiv icon

Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs

Add code
Aug 13, 2025
Figure 1 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Figure 2 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Figure 3 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Figure 4 for Efficient Forward-Only Data Valuation for Pretrained LLMs and VLMs
Viaarxiv icon