Picture for Yisi Sang

Yisi Sang

SENTINEL: Failure-Driven Reinforcement Learning for Training Tool-Using Language Model Agents

Add code
Jun 11, 2026
Viaarxiv icon

Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

Add code
Jun 01, 2026
Viaarxiv icon

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Add code
May 28, 2026
Viaarxiv icon

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon

ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs

Add code
Sep 04, 2025
Figure 1 for ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs
Figure 2 for ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs
Figure 3 for ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs
Figure 4 for ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs
Viaarxiv icon

ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models

Add code
Aug 12, 2024
Figure 1 for ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models
Figure 2 for ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models
Figure 3 for ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models
Figure 4 for ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models
Viaarxiv icon

APE: Active Learning-based Tooling for Finding Informative Few-shot Examples for LLM-based Entity Matching

Add code
Jul 29, 2024
Figure 1 for APE: Active Learning-based Tooling for Finding Informative Few-shot Examples for LLM-based Entity Matching
Viaarxiv icon

FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge

Add code
Oct 26, 2023
Figure 1 for FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge
Figure 2 for FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge
Figure 3 for FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge
Figure 4 for FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge
Viaarxiv icon

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

Add code
Nov 09, 2022
Viaarxiv icon

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Add code
Jun 24, 2022
Figure 1 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 2 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 3 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 4 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Viaarxiv icon