Picture for Jean Feng

Jean Feng

When the Domain Expert Has No Time and the LLM Developer Has No Clinical Expertise: Real-World Lessons from LLM Co-Design in a Safety-Net Hospital

Add code
Aug 11, 2025
Viaarxiv icon

Judging LLMs on a Simplex

Add code
May 28, 2025
Viaarxiv icon

Bayesian Concept Bottleneck Models with LLM Priors

Add code
Oct 21, 2024
Figure 1 for Bayesian Concept Bottleneck Models with LLM Priors
Figure 2 for Bayesian Concept Bottleneck Models with LLM Priors
Figure 3 for Bayesian Concept Bottleneck Models with LLM Priors
Figure 4 for Bayesian Concept Bottleneck Models with LLM Priors
Viaarxiv icon

A hierarchical decomposition for explaining ML performance discrepancies

Add code
Feb 22, 2024
Viaarxiv icon

A Brief Tutorial on Sample Size Calculations for Fairness Audits

Add code
Dec 07, 2023
Figure 1 for A Brief Tutorial on Sample Size Calculations for Fairness Audits
Figure 2 for A Brief Tutorial on Sample Size Calculations for Fairness Audits
Viaarxiv icon

Towards a Post-Market Monitoring Framework for Machine Learning-based Medical Devices: A case study

Add code
Nov 20, 2023
Figure 1 for Towards a Post-Market Monitoring Framework for Machine Learning-based Medical Devices: A case study
Figure 2 for Towards a Post-Market Monitoring Framework for Machine Learning-based Medical Devices: A case study
Figure 3 for Towards a Post-Market Monitoring Framework for Machine Learning-based Medical Devices: A case study
Figure 4 for Towards a Post-Market Monitoring Framework for Machine Learning-based Medical Devices: A case study
Viaarxiv icon

Is this model reliable for everyone? Testing for strong calibration

Add code
Jul 28, 2023
Figure 1 for Is this model reliable for everyone? Testing for strong calibration
Figure 2 for Is this model reliable for everyone? Testing for strong calibration
Figure 3 for Is this model reliable for everyone? Testing for strong calibration
Figure 4 for Is this model reliable for everyone? Testing for strong calibration
Viaarxiv icon

Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions

Add code
Nov 17, 2022
Figure 1 for Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions
Figure 2 for Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions
Figure 3 for Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions
Figure 4 for Monitoring machine learning (ML)-based risk prediction algorithms in the presence of confounding medical interventions
Viaarxiv icon

Sequential algorithmic modification with test data reuse

Add code
Mar 21, 2022
Figure 1 for Sequential algorithmic modification with test data reuse
Figure 2 for Sequential algorithmic modification with test data reuse
Figure 3 for Sequential algorithmic modification with test data reuse
Figure 4 for Sequential algorithmic modification with test data reuse
Viaarxiv icon

Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees

Add code
Oct 13, 2021
Figure 1 for Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees
Figure 2 for Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees
Figure 3 for Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees
Figure 4 for Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees
Viaarxiv icon