Picture for Yuchen Shi

Yuchen Shi

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 2 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 3 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 4 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Viaarxiv icon

FlowAgent: Achieving Compliance and Flexibility for Workflow Agents

Add code
Feb 20, 2025
Figure 1 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Figure 2 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Figure 3 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Figure 4 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Viaarxiv icon

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Add code
Jan 27, 2025
Figure 1 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Figure 2 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Figure 3 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Figure 4 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Viaarxiv icon

Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach

Add code
Dec 09, 2024
Figure 1 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Figure 2 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Figure 3 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Figure 4 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Viaarxiv icon

Towards Fault Tolerance in Multi-Agent Reinforcement Learning

Add code
Nov 30, 2024
Figure 1 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Figure 2 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Figure 3 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Figure 4 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Viaarxiv icon

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

Add code
Sep 03, 2024
Figure 1 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Figure 2 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Figure 3 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Figure 4 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Viaarxiv icon

CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns

Add code
Aug 27, 2024
Figure 1 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Figure 2 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Figure 3 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Figure 4 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Viaarxiv icon

Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration

Add code
Aug 21, 2024
Viaarxiv icon

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Add code
Aug 07, 2024
Figure 1 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 2 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 3 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 4 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Viaarxiv icon

Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning

Add code
May 09, 2024
Figure 1 for Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning
Figure 2 for Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning
Figure 3 for Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning
Figure 4 for Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning
Viaarxiv icon