Picture for Chien-Sheng Wu

Chien-Sheng Wu

SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

Add code
Dec 09, 2024
Viaarxiv icon

CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments

Add code
Nov 04, 2024
Viaarxiv icon

Evaluating Cultural and Social Awareness of LLM Web Agents

Add code
Oct 30, 2024
Viaarxiv icon

Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency

Add code
Oct 22, 2024
Figure 1 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Figure 2 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Figure 3 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Figure 4 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Viaarxiv icon

Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage

Add code
Oct 20, 2024
Figure 1 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Figure 2 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Figure 3 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Figure 4 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Viaarxiv icon

ReIFE: Re-evaluating Instruction-Following Evaluation

Add code
Oct 09, 2024
Figure 1 for ReIFE: Re-evaluating Instruction-Following Evaluation
Figure 2 for ReIFE: Re-evaluating Instruction-Following Evaluation
Figure 3 for ReIFE: Re-evaluating Instruction-Following Evaluation
Figure 4 for ReIFE: Re-evaluating Instruction-Following Evaluation
Viaarxiv icon

ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

Add code
Oct 03, 2024
Viaarxiv icon

Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits

Add code
Sep 26, 2024
Viaarxiv icon

Shared Imagination: LLMs Hallucinate Alike

Add code
Jul 23, 2024
Viaarxiv icon

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Add code
Jul 01, 2024
Figure 1 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Figure 2 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Figure 3 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Figure 4 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Viaarxiv icon