Alert button
Picture for John Henning

John Henning

Alert button

Post-processing Private Synthetic Data for Improving Utility on Selected Measures

May 24, 2023
Hao Wang, Shivchander Sudalairaj, John Henning, Kristjan Greenewald, Akash Srivastava

Figure 1 for Post-processing Private Synthetic Data for Improving Utility on Selected Measures
Figure 2 for Post-processing Private Synthetic Data for Improving Utility on Selected Measures
Figure 3 for Post-processing Private Synthetic Data for Improving Utility on Selected Measures
Figure 4 for Post-processing Private Synthetic Data for Improving Utility on Selected Measures

Existing private synthetic data generation algorithms are agnostic to downstream tasks. However, end users may have specific requirements that the synthetic data must satisfy. Failure to meet these requirements could significantly reduce the utility of the data for downstream use. We introduce a post-processing technique that improves the utility of the synthetic data with respect to measures selected by the end user, while preserving strong privacy guarantees and dataset quality. Our technique involves resampling from the synthetic data to filter out samples that do not meet the selected utility measures, using an efficient stochastic first-order algorithm to find optimal resampling weights. Through comprehensive numerical experiments, we demonstrate that our approach consistently improves the utility of synthetic data across multiple benchmark datasets and state-of-the-art synthetic data generation algorithms.

Viaarxiv icon

Rapid Development of Compositional AI

Feb 12, 2023
Lee Martie, Jessie Rosenberg, Veronique Demers, Gaoyuan Zhang, Onkar Bhardwaj, John Henning, Aditya Prasad, Matt Stallone, Ja Young Lee, Lucy Yip, Damilola Adesina, Elahe Paikari, Oscar Resendiz, Sarah Shaw, David Cox

Figure 1 for Rapid Development of Compositional AI
Figure 2 for Rapid Development of Compositional AI
Figure 3 for Rapid Development of Compositional AI
Figure 4 for Rapid Development of Compositional AI

Compositional AI systems, which combine multiple artificial intelligence components together with other application components to solve a larger problem, have no known pattern of development and are often approached in a bespoke and ad hoc style. This makes development slower and harder to reuse for future applications. To support the full rapid development cycle of compositional AI applications, we have developed a novel framework called (Bee)* (written as a regular expression and pronounced as "beestar"). We illustrate how (Bee)* supports building integrated, scalable, and interactive compositional AI applications with a simplified developer experience.

* 2023 IEEE/ACM 45th International Conference on Software Engineering: New Ideas and Emerging Technologies Results Track (ICSE-NIER), Melbourne, Australia, 2023, pp. (forthcoming)  
* Accepted to ICSE 2023, NIER track 
Viaarxiv icon