Picture for Jack Lanchantin

Jack Lanchantin

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Viaarxiv icon

Diverse Preference Optimization

Add code
Jan 31, 2025
Figure 1 for Diverse Preference Optimization
Figure 2 for Diverse Preference Optimization
Figure 3 for Diverse Preference Optimization
Figure 4 for Diverse Preference Optimization
Viaarxiv icon

Adaptive Decoding via Latent Preference Optimization

Add code
Nov 14, 2024
Figure 1 for Adaptive Decoding via Latent Preference Optimization
Figure 2 for Adaptive Decoding via Latent Preference Optimization
Figure 3 for Adaptive Decoding via Latent Preference Optimization
Figure 4 for Adaptive Decoding via Latent Preference Optimization
Viaarxiv icon

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Add code
Feb 21, 2024
Figure 1 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 2 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 3 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 4 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Viaarxiv icon

A Data Source for Reasoning Embodied Agents

Add code
Sep 14, 2023
Viaarxiv icon

Learning to Reason and Memorize with Self-Notes

Add code
May 01, 2023
Figure 1 for Learning to Reason and Memorize with Self-Notes
Figure 2 for Learning to Reason and Memorize with Self-Notes
Figure 3 for Learning to Reason and Memorize with Self-Notes
Figure 4 for Learning to Reason and Memorize with Self-Notes
Viaarxiv icon

General Multi-label Image Classification with Transformers

Add code
Nov 27, 2020
Figure 1 for General Multi-label Image Classification with Transformers
Figure 2 for General Multi-label Image Classification with Transformers
Figure 3 for General Multi-label Image Classification with Transformers
Figure 4 for General Multi-label Image Classification with Transformers
Viaarxiv icon

Reevaluating Adversarial Examples in Natural Language

Add code
Apr 25, 2020
Figure 1 for Reevaluating Adversarial Examples in Natural Language
Figure 2 for Reevaluating Adversarial Examples in Natural Language
Figure 3 for Reevaluating Adversarial Examples in Natural Language
Figure 4 for Reevaluating Adversarial Examples in Natural Language
Viaarxiv icon

Neural Message Passing for Multi-Label Classification

Add code
Apr 17, 2019
Figure 1 for Neural Message Passing for Multi-Label Classification
Figure 2 for Neural Message Passing for Multi-Label Classification
Figure 3 for Neural Message Passing for Multi-Label Classification
Figure 4 for Neural Message Passing for Multi-Label Classification
Viaarxiv icon