Picture for Halil Alperen Gozeten

Halil Alperen Gozeten

Evolutionary Multi-Task Optimization for LLM-Guided Program Discovery

Add code
May 21, 2026
Viaarxiv icon

Learning to Correct: Calibrated Reinforcement Learning for Multi-Attempt Chain-of-Thought

Add code
Apr 20, 2026
Viaarxiv icon

Continuous Chain of Thought Enables Parallel Exploration and Reasoning

Add code
May 29, 2025
Viaarxiv icon

Test-Time Training Provably Improves Transformers as In-context Learners

Add code
Mar 14, 2025
Viaarxiv icon

High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws

Add code
Oct 24, 2024
Viaarxiv icon