Picture for Gabriel Synnaeve

Gabriel Synnaeve

Jack

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Add code
May 23, 2025
Viaarxiv icon

Optimizing Language Models for Inference Time Objectives using Reinforcement Learning

Add code
Mar 25, 2025
Viaarxiv icon

BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?

Add code
Mar 20, 2025
Viaarxiv icon

The KoLMogorov Test: Compression by Code Generation

Add code
Mar 18, 2025
Viaarxiv icon

Soft Policy Optimization: Online Off-Policy RL for Sequence Models

Add code
Mar 07, 2025
Figure 1 for Soft Policy Optimization: Online Off-Policy RL for Sequence Models
Figure 2 for Soft Policy Optimization: Online Off-Policy RL for Sequence Models
Viaarxiv icon

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Add code
Feb 25, 2025
Viaarxiv icon

Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs

Add code
Oct 11, 2024
Figure 1 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Figure 2 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Figure 3 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Figure 4 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Viaarxiv icon

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

Add code
Oct 10, 2024
Figure 1 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Figure 2 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Figure 3 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Figure 4 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Viaarxiv icon

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Add code
Oct 04, 2024
Figure 1 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 2 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 3 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 4 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Viaarxiv icon

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Add code
Oct 02, 2024
Figure 1 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Figure 2 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Figure 3 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Figure 4 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Viaarxiv icon