Picture for Yoshua Bengio

Yoshua Bengio

DIRO

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Add code
Dec 26, 2025
Viaarxiv icon

Sliding Window Recurrences for Sequence Models

Add code
Dec 15, 2025
Viaarxiv icon

FALCON: Few-step Accurate Likelihoods for Continuous Flows

Add code
Dec 10, 2025
Viaarxiv icon

Scaling Latent Reasoning via Looped Language Models

Add code
Oct 29, 2025
Figure 1 for Scaling Latent Reasoning via Looped Language Models
Figure 2 for Scaling Latent Reasoning via Looped Language Models
Figure 3 for Scaling Latent Reasoning via Looped Language Models
Figure 4 for Scaling Latent Reasoning via Looped Language Models
Viaarxiv icon

Surrogate-based quantification of policy uncertainty in generative flow networks

Add code
Oct 24, 2025
Viaarxiv icon

Catalyst GFlowNet for electrocatalyst design: A hydrogen evolution reaction case study

Add code
Oct 02, 2025
Viaarxiv icon

Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models

Add code
Sep 30, 2025
Viaarxiv icon

Active Attacks: Red-teaming LLMs via Adaptive Environments

Add code
Sep 26, 2025
Figure 1 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Figure 2 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Figure 3 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Figure 4 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Viaarxiv icon

Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

Add code
Jul 15, 2025
Figure 1 for Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
Viaarxiv icon

Torsional-GFN: a conditional conformation generator for small molecules

Add code
Jul 15, 2025
Viaarxiv icon