Picture for Hayate Iso

Hayate Iso

Evaluating Bias in LLMs for Job-Resume Matching: Gender, Race, and Education

Add code
Mar 24, 2025
Viaarxiv icon

From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization

Add code
Oct 17, 2024
Figure 1 for From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Figure 2 for From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Figure 3 for From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Figure 4 for From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Viaarxiv icon

Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data

Add code
Oct 15, 2024
Figure 1 for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data
Figure 2 for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data
Figure 3 for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data
Figure 4 for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data
Viaarxiv icon

A Blueprint Architecture of Compound AI Systems for Enterprise

Add code
Jun 02, 2024
Figure 1 for A Blueprint Architecture of Compound AI Systems for Enterprise
Figure 2 for A Blueprint Architecture of Compound AI Systems for Enterprise
Figure 3 for A Blueprint Architecture of Compound AI Systems for Enterprise
Viaarxiv icon

AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Add code
Feb 27, 2024
Figure 1 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Figure 2 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Figure 3 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Figure 4 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Viaarxiv icon

Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks

Add code
Nov 10, 2023
Figure 1 for Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks
Figure 2 for Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks
Figure 3 for Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks
Figure 4 for Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks
Viaarxiv icon

XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates

Add code
Sep 20, 2023
Figure 1 for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
Figure 2 for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
Figure 3 for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
Figure 4 for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
Viaarxiv icon

Less is More for Long Document Summary Evaluation by LLMs

Add code
Sep 14, 2023
Viaarxiv icon

Zero-shot Triplet Extraction by Template Infilling

Add code
Dec 21, 2022
Figure 1 for Zero-shot Triplet Extraction by Template Infilling
Figure 2 for Zero-shot Triplet Extraction by Template Infilling
Figure 3 for Zero-shot Triplet Extraction by Template Infilling
Figure 4 for Zero-shot Triplet Extraction by Template Infilling
Viaarxiv icon