Picture for Jean-Francois Ton

Jean-Francois Ton

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Add code
Mar 08, 2024
Figure 1 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 2 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 3 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Viaarxiv icon

Dataset Fairness: Achievable Fairness on Your Data With Utility Guarantees

Add code
Feb 27, 2024
Viaarxiv icon

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

Add code
Feb 16, 2024
Figure 1 for Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
Figure 2 for Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
Figure 3 for Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
Figure 4 for Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
Viaarxiv icon

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

Add code
Dec 03, 2023
Viaarxiv icon

Deep Concept Removal

Add code
Oct 09, 2023
Viaarxiv icon

Invariant Learning via Probability of Sufficient and Necessary Causes

Add code
Sep 22, 2023
Viaarxiv icon

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Add code
Aug 10, 2023
Figure 1 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 2 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 3 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 4 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Viaarxiv icon

Conformal Off-Policy Prediction in Contextual Bandits

Add code
Jun 09, 2022
Figure 1 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 2 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 3 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 4 for Conformal Off-Policy Prediction in Contextual Bandits
Viaarxiv icon

Regularized Training of Nearest Neighbor Language Models

Add code
Sep 16, 2021
Figure 1 for Regularized Training of Nearest Neighbor Language Models
Figure 2 for Regularized Training of Nearest Neighbor Language Models
Figure 3 for Regularized Training of Nearest Neighbor Language Models
Figure 4 for Regularized Training of Nearest Neighbor Language Models
Viaarxiv icon

Meta Learning for Causal Direction

Add code
Jul 06, 2020
Figure 1 for Meta Learning for Causal Direction
Figure 2 for Meta Learning for Causal Direction
Figure 3 for Meta Learning for Causal Direction
Figure 4 for Meta Learning for Causal Direction
Viaarxiv icon