Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Changkun Ou

Progressive Autonomy as Preference Learning: A Formalization of Trust Calibration for Agentic Tool Use

May 18, 2026

Changkun Ou

Abstract:We formalize trust calibration for agentic tool use (deciding when an automated agent's proposed action may execute autonomously versus require human approval) as a preference-learning problem. A policy gateway maintains a Gaussian-process posterior over a latent human risk-tolerance function, observed through a probit likelihood on binary approve/deny feedback, and escalates to the human exactly where the approval outcome is most uncertain. We show this is structurally an instance of Preferential Bayesian Optimization, inheriting its inference machinery (approximate Gaussian-process classification) and its sample-efficiency argument (uncertainty-targeted querying), while differing in objective: classifying an action space into allow/block/ask regions rather than optimizing a design.

Via

Access Paper or Ask Questions

The Impact of Expertise in the Loop for Exploring Machine Rationality

Feb 11, 2023

Changkun Ou, Sven Mayer, Andreas Butz

Abstract:Human-in-the-loop optimization utilizes human expertise to guide machine optimizers iteratively and search for an optimal solution in a solution space. While prior empirical studies mainly investigated novices, we analyzed the impact of the levels of expertise on the outcome quality and corresponding subjective satisfaction. We conducted a study (N=60) in text, photo, and 3D mesh optimization contexts. We found that novices can achieve an expert level of quality performance, but participants with higher expertise led to more optimization iteration with more explicit preference while keeping satisfaction low. In contrast, novices were more easily satisfied and terminated faster. Therefore, we identified that experts seek more diverse outcomes while the machine reaches optimal results, and the observed behavior can be used as a performance indicator for human-in-the-loop system designers to improve underlying models. We inform future research to be cautious about the impact of user expertise when designing human-in-the-loop systems.

* In 28th International Conference on Intelligent User Interfaces (IUI '23), March 27-31, 2023, Sydney, NSW, Australia. ACM, New York, NY, USA, 15 pages

Via

Access Paper or Ask Questions

Identifying Malicious Players in GWAP-based Disaster Monitoring Crowdsourcing System

Sep 14, 2019

Changkun Ou, Yifei Zhan, Yaxi Chen

Figure 1 for Identifying Malicious Players in GWAP-based Disaster Monitoring Crowdsourcing System

Figure 2 for Identifying Malicious Players in GWAP-based Disaster Monitoring Crowdsourcing System

Abstract:Disaster monitoring is challenging due to the lake of infrastructures in monitoring areas. Based on the theory of Game-With-A-Purpose (GWAP), this paper contributes to a novel large-scale crowdsourcing disaster monitoring system. The system analyzes tagged satellite pictures from anonymous players, and then reports aggregated and evaluated monitoring results to its stakeholders. An algorithm based on directed graph centralities is presented to address the core issues of malicious user detection and disaster level calculation. Our method can be easily applied in other human computation systems. In the end, some issues with possible solutions are discussed for our future work.

* In IEEE ICAIBD' 19: Proceedings of the 2nd International Conference on Artificial Intelligence and Big Data. Chengdu, Sichuan, China, May 25-28, 2019

Via

Access Paper or Ask Questions