Picture for Yuqi Zhu

Yuqi Zhu

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

Add code
Apr 27, 2026
Viaarxiv icon

StructMem: Structured Memory for Long-Horizon Behavior in LLMs

Add code
Apr 23, 2026
Viaarxiv icon

LightThinker++: From Reasoning Compression to Memory Management

Add code
Apr 04, 2026
Viaarxiv icon

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Add code
Sep 30, 2025
Viaarxiv icon

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study

Add code
Jun 24, 2025
Figure 1 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 2 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 3 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Figure 4 for Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Viaarxiv icon

Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey

Add code
May 06, 2025
Figure 1 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 2 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 3 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Figure 4 for Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Viaarxiv icon

LightThinker: Thinking Step-by-Step Compression

Add code
Feb 21, 2025
Viaarxiv icon

Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude

Add code
Jan 17, 2025
Figure 1 for Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
Figure 2 for Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
Figure 3 for Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
Figure 4 for Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
Viaarxiv icon

Consistency of Responses and Continuations Generated by Large Language Models on Social Media

Add code
Jan 15, 2025
Figure 1 for Consistency of Responses and Continuations Generated by Large Language Models on Social Media
Figure 2 for Consistency of Responses and Continuations Generated by Large Language Models on Social Media
Figure 3 for Consistency of Responses and Continuations Generated by Large Language Models on Social Media
Figure 4 for Consistency of Responses and Continuations Generated by Large Language Models on Social Media
Viaarxiv icon

Showing LLM-Generated Code Selectively Based on Confidence of LLMs

Add code
Oct 04, 2024
Viaarxiv icon