Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rikuto Tsuchida

How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses

Feb 19, 2026

Kan Watanabe, Rikuto Tsuchida, Takahiro Monno, Bin Huang, Kazuma Yamasaki, Youmei Fan, Kazumasa Shimari, Kenichi Matsumoto

Abstract:The rapid adoption of large language models has led to the emergence of AI coding agents that autonomously create pull requests on GitHub. However, how these agents differ in their pull request description characteristics, and how human reviewers respond to them, remains underexplored. In this study, we conduct an empirical analysis of pull requests created by five AI coding agents using the AIDev dataset. We analyze agent differences in pull request description characteristics, including structural features, and examine human reviewer response in terms of review activity, response timing, sentiment, and merge outcomes. We find that AI coding agents exhibit distinct PR description styles, which are associated with differences in reviewer engagement, response time, and merge outcomes. We observe notable variation across agents in both reviewer interaction metrics and merge rates. These findings highlight the role of pull request presentation and reviewer interaction dynamics in human-AI collaborative software development.

Via

Access Paper or Ask Questions

Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation

Mar 18, 2025

Rikuto Tsuchida, Hibiki Yokoyama, Takehito Utsuro

Figure 1 for Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation

Figure 2 for Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation

Figure 3 for Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation

Figure 4 for Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation

Abstract:The purpose of this paper is to examine whether large language models (LLMs) can understand what is good and evil with respect to judging good/evil reputation of celebrities. Specifically, we first apply a large language model (namely, ChatGPT) to the task of collecting sentences that mention the target celebrity from articles about celebrities on Web pages. Next, the collected sentences are categorized based on their contents by ChatGPT, where ChatGPT assigns a category name to each of those categories. Those assigned category names are referred to as "aspects" of each celebrity. Then, by applying the framework of retrieval augmented generation (RAG), we show that the large language model is quite effective in the task of judging good/evil reputation of aspects and descriptions of each celebrity. Finally, also in terms of proving the advantages of the proposed method over existing services incorporating RAG functions, we show that the proposed method of judging good/evil of aspects/descriptions of each celebrity significantly outperform an existing service incorporating RAG functions.

Via

Access Paper or Ask Questions