Picture for Jiaao Chen

Jiaao Chen

WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning

Add code
May 28, 2025
Viaarxiv icon

MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems

Add code
May 22, 2025
Viaarxiv icon

Position: Standard Benchmarks Fail -- LLM Agents Present Overlooked Risks for Financial Applications

Add code
Feb 21, 2025
Viaarxiv icon

Dynamic Skill Adaptation for Large Language Models

Add code
Dec 26, 2024
Viaarxiv icon

Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review

Add code
Dec 02, 2024
Figure 1 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 2 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 3 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 4 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Viaarxiv icon

DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph

Add code
Jun 25, 2024
Viaarxiv icon

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media

Add code
Nov 16, 2023
Viaarxiv icon

Unlearn What You Want to Forget: Efficient Unlearning for LLMs

Add code
Oct 31, 2023
Viaarxiv icon

DyVal: Graph-informed Dynamic Evaluation of Large Language Models

Add code
Oct 05, 2023
Viaarxiv icon

Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Add code
Aug 14, 2023
Viaarxiv icon