Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhenning Yang

Cloud Infrastructure Management in the Age of AI Agents

Jun 13, 2025

Zhenning Yang, Archit Bhatnagar, Yiming Qiu, Tongyuan Miao, Patrick Tser Jern Kon, Yunming Xiao, Yibo Huang, Martin Casado, Ang Chen

Abstract:Cloud infrastructure is the cornerstone of the modern IT industry. However, managing this infrastructure effectively requires considerable manual effort from the DevOps engineering team. We make a case for developing AI agents powered by large language models (LLMs) to automate cloud infrastructure management tasks. In a preliminary study, we investigate the potential for AI agents to use different cloud/user interfaces such as software development kits (SDK), command line interfaces (CLI), Infrastructure-as-Code (IaC) platforms, and web portals. We report takeaways on their effectiveness on different management tasks, and identify research challenges and potential solutions.

Via

Access Paper or Ask Questions

SQUiD: Synthesizing Relational Databases from Unstructured Text

May 25, 2025

Mushtari Sadia, Zhenning Yang, Yunming Xiao, Ang Chen, Amrita Roy Chowdhury

Abstract:Relational databases are central to modern data management, yet most data exists in unstructured forms like text documents. To bridge this gap, we leverage large language models (LLMs) to automatically synthesize a relational database by generating its schema and populating its tables from raw text. We introduce SQUiD, a novel neurosymbolic framework that decomposes this task into four stages, each with specialized techniques. Our experiments show that SQUiD consistently outperforms baselines across diverse datasets.

Via

Access Paper or Ask Questions

Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents

Feb 26, 2025

Patrick Tser Jern Kon, Jiachen Liu, Qiuyi Ding, Yiming Qiu, Zhenning Yang, Yibo Huang, Jayanth Srinivasa, Myungjin Lee, Mosharaf Chowdhury, Ang Chen

Abstract:Scientific experimentation, a cornerstone of human progress, demands rigor in reliability, methodical control, and interpretability to yield meaningful results. Despite the growing capabilities of large language models (LLMs) in automating different aspects of the scientific process, automating rigorous experimentation remains a significant challenge. To address this gap, we propose Curie, an AI agent framework designed to embed rigor into the experimentation process through three key components: an intra-agent rigor module to enhance reliability, an inter-agent rigor module to maintain methodical control, and an experiment knowledge module to enhance interpretability. To evaluate Curie, we design a novel experimental benchmark composed of 46 questions across four computer science domains, derived from influential research papers, and widely adopted open-source projects. Compared to the strongest baseline tested, we achieve a 3.4$\times$ improvement in correctly answering experimental questions. Curie is open-sourced at https://github.com/Just-Curieous/Curie.

* 21 pages

Via

Access Paper or Ask Questions

Adversarial Attacks and Defense for Conversation Entailment Task

May 02, 2024

Zhenning Yang, Ryan Krawec, Liang-Yuan Wu

Abstract:As the deployment of NLP systems in critical applications grows, ensuring the robustness of large language models (LLMs) against adversarial attacks becomes increasingly important. Large language models excel in various NLP tasks but remain vulnerable to low-cost adversarial attacks. Focusing on the domain of conversation entailment, where multi-turn dialogues serve as premises to verify hypotheses, we fine-tune a transformer model to accurately discern the truthfulness of these hypotheses. Adversaries manipulate hypotheses through synonym swapping, aiming to deceive the model into making incorrect predictions. To counteract these attacks, we implemented innovative fine-tuning techniques and introduced an embedding perturbation loss method to significantly bolster the model's robustness. Our findings not only emphasize the importance of defending against adversarial attacks in NLP but also highlight the real-world implications, suggesting that enhancing model robustness is critical for reliable NLP applications.

Via

Access Paper or Ask Questions

Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

Sep 15, 2023

Insu Jang, Zhenning Yang, Zhen Zhang, Xin Jin, Mosharaf Chowdhury

Figure 1 for Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

Figure 2 for Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

Figure 3 for Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

Figure 4 for Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates

Abstract:Oobleck enables resilient distributed training of large DNN models with guaranteed fault tolerance. It takes a planning-execution co-design approach, where it first generates a set of heterogeneous pipeline templates and instantiates at least $f+1$ logically equivalent pipeline replicas to tolerate any $f$ simultaneous failures. During execution, it relies on already-replicated model states across the replicas to provide fast recovery. Oobleck provably guarantees that some combination of the initially created pipeline templates can be used to cover all available resources after $f$ or fewer simultaneous failures, thereby avoiding resource idling at all times. Evaluation on large DNN models with billions of parameters shows that Oobleck provides consistently high throughput, and it outperforms state-of-the-art fault tolerance solutions like Bamboo and Varuna by up to $13.9x$.

* SOSP'23 | Camera-ready

Via

Access Paper or Ask Questions

Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training

Mar 04, 2023

Zhenning Yang, Luoxi Meng, Jae-Won Chung, Mosharaf Chowdhury

Figure 1 for Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training

Figure 2 for Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training

Abstract:Deep learning has experienced significant growth in recent years, resulting in increased energy consumption and carbon emission from the use of GPUs for training deep neural networks (DNNs). Answering the call for sustainability, conventional solutions have attempted to move training jobs to locations or time frames with lower carbon intensity. However, moving jobs to other locations may not always be feasible due to large dataset sizes or data regulations. Moreover, postponing training can negatively impact application service quality because the DNNs backing the service are not updated in a timely fashion. In this work, we present a practical solution that reduces the carbon footprint of DNN training without migrating or postponing jobs. Specifically, our solution observes real-time carbon intensity shifts during training and controls the energy consumption of GPUs, thereby reducing carbon footprint while maintaining training performance. Furthermore, in order to proactively adapt to shifting carbon intensity, we propose a lightweight machine learning algorithm that predicts the carbon intensity of the upcoming time frame. Our solution, Chase, reduces the total carbon footprint of training ResNet-50 on ImageNet by 13.6% while only increasing training time by 2.5%.

* ICLR 23 Workshop | https://ml.energy/zeus

Via

Access Paper or Ask Questions

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Feb 24, 2022

Sara Mousavi, Zhenning Yang, Kelley Cross, Dawnie Steadman, Audris Mockus

Figure 1 for SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Figure 2 for SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Figure 3 for SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Figure 4 for SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Abstract:Semantic segmentation is a challenging computer vision task demanding a significant amount of pixel-level annotated data. Producing such data is a time-consuming and costly process, especially for domains with a scarcity of experts, such as medicine or forensic anthropology. While numerous semi-supervised approaches have been developed to make the most from the limited labeled data and ample amount of unlabeled data, domain-specific real-world datasets often have characteristics that both reduce the effectiveness of off-the-shelf state-of-the-art methods and also provide opportunities to create new methods that exploit these characteristics. We propose and evaluate a semi-supervised method that reuses available labels for unlabeled images of a dataset by exploiting existing similarities, while dynamically weighting the impact of these reused labels in the training process. We evaluate our method on a large dataset of human decomposition images and find that our method, while conceptually simple, outperforms state-of-the-art consistency and pseudo-labeling-based methods for the segmentation of this dataset. This paper includes graphic content of human decomposition.

Via

Access Paper or Ask Questions

Pseudo Pixel-level Labeling for Images with Evolving Content

May 20, 2021

Sara Mousavi, Zhenning Yang, Kelley Cross, Dawnie Steadman, Audris Mockus

Figure 1 for Pseudo Pixel-level Labeling for Images with Evolving Content

Figure 2 for Pseudo Pixel-level Labeling for Images with Evolving Content

Figure 3 for Pseudo Pixel-level Labeling for Images with Evolving Content

Abstract:Annotating images for semantic segmentation requires intense manual labor and is a time-consuming and expensive task especially for domains with a scarcity of experts, such as Forensic Anthropology. We leverage the evolving nature of images depicting the decay process in human decomposition data to design a simple yet effective pseudo-pixel-level label generation technique to reduce the amount of effort for manual annotation of such images. We first identify sequences of images with a minimum variation that are most suitable to share the same or similar annotation using an unsupervised approach. Given one user-annotated image in each sequence, we propagate the annotation to the remaining images in the sequence by merging it with annotations produced by a state-of-the-art CAM-based pseudo label generation technique. To evaluate the quality of our pseudo-pixel-level labels, we train two semantic segmentation models with VGG and ResNet backbones on images labeled using our pseudo labeling method and those of a state-of-the-art method. The results indicate that using our pseudo-labels instead of those generated using the state-of-the-art method in the training process improves the mean-IoU and the frequency-weighted-IoU of the VGG and ResNet-based semantic segmentation models by 3.36%, 2.58%, 10.39%, and 12.91% respectively.

Via

Access Paper or Ask Questions

Conditional Gaussian Distribution Learning for Open Set Recognition

Apr 17, 2020

Xin Sun, Zhenning Yang, Chi Zhang, Guohao Peng, Keck-Voon Ling

Figure 1 for Conditional Gaussian Distribution Learning for Open Set Recognition

Figure 2 for Conditional Gaussian Distribution Learning for Open Set Recognition

Figure 3 for Conditional Gaussian Distribution Learning for Open Set Recognition

Figure 4 for Conditional Gaussian Distribution Learning for Open Set Recognition

Abstract:Deep neural networks have achieved state-of-the-art performance in a wide range of recognition/classification tasks. However, when applying deep learning to real-world applications, there are still multiple challenges. A typical challenge is that unknown samples may be fed into the system during the testing phase and traditional deep neural networks will wrongly recognize the unknown sample as one of the known classes. Open set recognition is a potential solution to overcome this problem, where the open set classifier should have the ability to reject unknown samples as well as maintain high classification accuracy on known classes. The variational auto-encoder (VAE) is a popular model to detect unknowns, but it cannot provide discriminative representations for known classification. In this paper, we propose a novel method, Conditional Gaussian Distribution Learning (CGDL), for open set recognition. In addition to detecting unknown samples, this method can also classify known samples by forcing different latent features to approximate different Gaussian models. Meanwhile, to avoid information hidden in the input vanishing in the middle layers, we also adopt the probabilistic ladder architecture to extract high-level abstract features. Experiments on several standard image datasets reveal that the proposed method significantly outperforms the baseline method and achieves new state-of-the-art results.

* Accepted to CVPR2020. Source code link is provided in the latest version

Via

Access Paper or Ask Questions