Alert button
Picture for Jean-Francois Ton

Jean-Francois Ton

Alert button

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Add code
Bookmark button
Alert button
Mar 08, 2024
Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

Figure 1 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 2 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 3 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Viaarxiv icon

Dataset Fairness: Achievable Fairness on Your Data With Utility Guarantees

Add code
Bookmark button
Alert button
Feb 27, 2024
Muhammad Faaiz Taufiq, Jean-Francois Ton, Yang Liu

Viaarxiv icon

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

Add code
Bookmark button
Alert button
Feb 16, 2024
Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu

Viaarxiv icon

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

Add code
Bookmark button
Alert button
Dec 03, 2023
Muhammad Faaiz Taufiq, Arnaud Doucet, Rob Cornish, Jean-Francois Ton

Viaarxiv icon

Deep Concept Removal

Add code
Bookmark button
Alert button
Oct 09, 2023
Yegor Klochkov, Jean-Francois Ton, Ruocheng Guo, Yang Liu, Hang Li

Viaarxiv icon

Invariant Learning via Probability of Sufficient and Necessary Causes

Add code
Bookmark button
Alert button
Sep 22, 2023
Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jun Wang

Viaarxiv icon

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Add code
Bookmark button
Alert button
Aug 10, 2023
Yang Liu, Yuanshun Yao, Jean-Francois Ton, Xiaoying Zhang, Ruocheng Guo Hao Cheng, Yegor Klochkov, Muhammad Faaiz Taufiq, Hang Li

Figure 1 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 2 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 3 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 4 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Viaarxiv icon

Conformal Off-Policy Prediction in Contextual Bandits

Add code
Bookmark button
Alert button
Jun 09, 2022
Muhammad Faaiz Taufiq, Jean-Francois Ton, Rob Cornish, Yee Whye Teh, Arnaud Doucet

Figure 1 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 2 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 3 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 4 for Conformal Off-Policy Prediction in Contextual Bandits
Viaarxiv icon

Regularized Training of Nearest Neighbor Language Models

Add code
Bookmark button
Alert button
Sep 16, 2021
Jean-Francois Ton, Walter Talbott, Shuangfei Zhai, Josh Susskind

Figure 1 for Regularized Training of Nearest Neighbor Language Models
Figure 2 for Regularized Training of Nearest Neighbor Language Models
Figure 3 for Regularized Training of Nearest Neighbor Language Models
Figure 4 for Regularized Training of Nearest Neighbor Language Models
Viaarxiv icon

Meta Learning for Causal Direction

Add code
Bookmark button
Alert button
Jul 06, 2020
Jean-Francois Ton, Dino Sejdinovic, Kenji Fukumizu

Figure 1 for Meta Learning for Causal Direction
Figure 2 for Meta Learning for Causal Direction
Figure 3 for Meta Learning for Causal Direction
Figure 4 for Meta Learning for Causal Direction
Viaarxiv icon