Picture for Igor Kiselev

Igor Kiselev

Accenture

ThinkBooster: A Unified Framework for Seamless Test-Time Scaling of LLM Reasoning

Add code
Jun 05, 2026
Viaarxiv icon

A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs

Add code
May 13, 2025
Viaarxiv icon

Yes, Q-learning Helps Offline In-Context RL

Add code
Feb 24, 2025
Figure 1 for Yes, Q-learning Helps Offline In-Context RL
Figure 2 for Yes, Q-learning Helps Offline In-Context RL
Figure 3 for Yes, Q-learning Helps Offline In-Context RL
Figure 4 for Yes, Q-learning Helps Offline In-Context RL
Viaarxiv icon