Picture for Alessandro Sordoni

Alessandro Sordoni

Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models

Add code
Dec 16, 2025
Figure 1 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Figure 2 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Figure 3 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Figure 4 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Viaarxiv icon

Learning to Extract Context for Context-Aware LLM Inference

Add code
Dec 12, 2025
Viaarxiv icon

Gistify! Codebase-Level Understanding via Runtime Execution

Add code
Oct 30, 2025
Figure 1 for Gistify! Codebase-Level Understanding via Runtime Execution
Figure 2 for Gistify! Codebase-Level Understanding via Runtime Execution
Figure 3 for Gistify! Codebase-Level Understanding via Runtime Execution
Figure 4 for Gistify! Codebase-Level Understanding via Runtime Execution
Viaarxiv icon

Medical Red Teaming Protocol of Language Models: On the Importance of User Perspectives in Healthcare Settings

Add code
Jul 09, 2025
Viaarxiv icon

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

Add code
May 15, 2025
Viaarxiv icon

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Add code
May 07, 2025
Figure 1 for Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
Figure 2 for Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
Figure 3 for Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
Figure 4 for Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
Viaarxiv icon

debug-gym: A Text-Based Environment for Interactive Debugging

Add code
Mar 27, 2025
Figure 1 for debug-gym: A Text-Based Environment for Interactive Debugging
Figure 2 for debug-gym: A Text-Based Environment for Interactive Debugging
Figure 3 for debug-gym: A Text-Based Environment for Interactive Debugging
Figure 4 for debug-gym: A Text-Based Environment for Interactive Debugging
Viaarxiv icon

Training Plug-n-Play Knowledge Modules with Deep Context Distillation

Add code
Mar 11, 2025
Viaarxiv icon

Not All LLM Reasoners Are Created Equal

Add code
Oct 02, 2024
Figure 1 for Not All LLM Reasoners Are Created Equal
Figure 2 for Not All LLM Reasoners Are Created Equal
Figure 3 for Not All LLM Reasoners Are Created Equal
Figure 4 for Not All LLM Reasoners Are Created Equal
Viaarxiv icon

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Add code
Oct 02, 2024
Figure 1 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 2 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 3 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 4 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Viaarxiv icon