Picture for Valerie Chen

Valerie Chen

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents

Add code
Nov 05, 2025
Viaarxiv icon

Completion $ eq$ Collaboration: Scaling Collaborative Effort with Agents

Add code
Oct 30, 2025
Viaarxiv icon

Beyond Memorization: Mapping the Originality-Quality Frontier of Language Models

Add code
Apr 13, 2025
Viaarxiv icon

The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

Add code
Apr 03, 2024
Figure 1 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 2 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 3 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 4 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Viaarxiv icon

Do LLMs exhibit human-like response biases? A case study in survey design

Add code
Nov 07, 2023
Viaarxiv icon

AdvisingNets: Learning to Distinguish Correct and Wrong Classifications via Nearest-Neighbor Explanations

Add code
Aug 25, 2023
Viaarxiv icon

FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines

Add code
Jul 28, 2023
Figure 1 for FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines
Figure 2 for FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines
Figure 3 for FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines
Figure 4 for FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines
Viaarxiv icon

Learning Personalized Decision Support Policies

Add code
Apr 13, 2023
Figure 1 for Learning Personalized Decision Support Policies
Figure 2 for Learning Personalized Decision Support Policies
Figure 3 for Learning Personalized Decision Support Policies
Figure 4 for Learning Personalized Decision Support Policies
Viaarxiv icon

Assisting Human Decisions in Document Matching

Add code
Feb 16, 2023
Figure 1 for Assisting Human Decisions in Document Matching
Figure 2 for Assisting Human Decisions in Document Matching
Figure 3 for Assisting Human Decisions in Document Matching
Figure 4 for Assisting Human Decisions in Document Matching
Viaarxiv icon

A Case Study on Designing Evaluations of ML Explanations with Simulated User Studies

Add code
Feb 15, 2023
Viaarxiv icon