Picture for Tetsuya Sakai

Tetsuya Sakai

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Add code
Jun 28, 2024
Viaarxiv icon

CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models

Add code
May 20, 2024
Viaarxiv icon

Vector Quantization for Recommender Systems: A Review and Outlook

Add code
May 06, 2024
Viaarxiv icon

ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval

Add code
Apr 21, 2024
Viaarxiv icon

Decoy Effect In Search Interaction: Understanding User Behavior and Measuring System Vulnerability

Add code
Mar 27, 2024
Figure 1 for Decoy Effect In Search Interaction: Understanding User Behavior and Measuring System Vulnerability
Figure 2 for Decoy Effect In Search Interaction: Understanding User Behavior and Measuring System Vulnerability
Figure 3 for Decoy Effect In Search Interaction: Understanding User Behavior and Measuring System Vulnerability
Figure 4 for Decoy Effect In Search Interaction: Understanding User Behavior and Measuring System Vulnerability
Viaarxiv icon

Decoy Effect in Search Interaction: A Pilot Study

Add code
Nov 04, 2023
Figure 1 for Decoy Effect in Search Interaction: A Pilot Study
Figure 2 for Decoy Effect in Search Interaction: A Pilot Study
Figure 3 for Decoy Effect in Search Interaction: A Pilot Study
Figure 4 for Decoy Effect in Search Interaction: A Pilot Study
Viaarxiv icon

EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval

Add code
Oct 02, 2023
Figure 1 for EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval
Figure 2 for EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval
Figure 3 for EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval
Figure 4 for EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval
Viaarxiv icon

Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores

Add code
Sep 30, 2023
Figure 1 for Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores
Figure 2 for Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores
Figure 3 for Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores
Figure 4 for Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores
Viaarxiv icon

Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval

Add code
Aug 05, 2023
Figure 1 for Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval
Figure 2 for Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval
Figure 3 for Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval
Figure 4 for Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval
Viaarxiv icon

A Meta-Evaluation of C/W/L/A Metrics: System Ranking Similarity, System Ranking Consistency and Discriminative Power

Add code
Jul 06, 2023
Figure 1 for A Meta-Evaluation of C/W/L/A Metrics: System Ranking Similarity, System Ranking Consistency and Discriminative Power
Figure 2 for A Meta-Evaluation of C/W/L/A Metrics: System Ranking Similarity, System Ranking Consistency and Discriminative Power
Figure 3 for A Meta-Evaluation of C/W/L/A Metrics: System Ranking Similarity, System Ranking Consistency and Discriminative Power
Figure 4 for A Meta-Evaluation of C/W/L/A Metrics: System Ranking Similarity, System Ranking Consistency and Discriminative Power
Viaarxiv icon