Picture for Yoshua Bengio

Yoshua Bengio

DIRO

Discrete Feynman-Kac Correctors

Add code
Jan 15, 2026
Viaarxiv icon

In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior

Add code
Jan 06, 2026
Viaarxiv icon

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Add code
Dec 26, 2025
Viaarxiv icon

Sliding Window Recurrences for Sequence Models

Add code
Dec 15, 2025
Figure 1 for Sliding Window Recurrences for Sequence Models
Figure 2 for Sliding Window Recurrences for Sequence Models
Figure 3 for Sliding Window Recurrences for Sequence Models
Figure 4 for Sliding Window Recurrences for Sequence Models
Viaarxiv icon

FALCON: Few-step Accurate Likelihoods for Continuous Flows

Add code
Dec 10, 2025
Figure 1 for FALCON: Few-step Accurate Likelihoods for Continuous Flows
Figure 2 for FALCON: Few-step Accurate Likelihoods for Continuous Flows
Figure 3 for FALCON: Few-step Accurate Likelihoods for Continuous Flows
Figure 4 for FALCON: Few-step Accurate Likelihoods for Continuous Flows
Viaarxiv icon

Scaling Latent Reasoning via Looped Language Models

Add code
Oct 29, 2025
Figure 1 for Scaling Latent Reasoning via Looped Language Models
Figure 2 for Scaling Latent Reasoning via Looped Language Models
Figure 3 for Scaling Latent Reasoning via Looped Language Models
Figure 4 for Scaling Latent Reasoning via Looped Language Models
Viaarxiv icon

Surrogate-based quantification of policy uncertainty in generative flow networks

Add code
Oct 24, 2025
Viaarxiv icon

Catalyst GFlowNet for electrocatalyst design: A hydrogen evolution reaction case study

Add code
Oct 02, 2025
Viaarxiv icon

Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models

Add code
Sep 30, 2025
Viaarxiv icon

Active Attacks: Red-teaming LLMs via Adaptive Environments

Add code
Sep 26, 2025
Figure 1 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Figure 2 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Figure 3 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Figure 4 for Active Attacks: Red-teaming LLMs via Adaptive Environments
Viaarxiv icon