Picture for Tetsuya Sakai

Tetsuya Sakai

OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG

Add code
Jan 13, 2026
Viaarxiv icon

e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings

Add code
Jan 07, 2026
Viaarxiv icon

Judging with Personality and Confidence: A Study on Personality-Conditioned LLM Relevance Assessment

Add code
Jan 05, 2026
Viaarxiv icon

Diversification as Risk Minimization

Add code
Oct 26, 2025
Viaarxiv icon

LLM-Assisted Relevance Assessments: When Should We Ask LLMs for Help?

Add code
Nov 11, 2024
Viaarxiv icon

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Add code
Oct 30, 2024
Figure 1 for CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Figure 2 for CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Figure 3 for CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Figure 4 for CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Viaarxiv icon

Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models

Add code
Oct 04, 2024
Figure 1 for Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models
Figure 2 for Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models
Figure 3 for Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models
Figure 4 for Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models
Viaarxiv icon

AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment

Add code
Sep 24, 2024
Figure 1 for AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment
Figure 2 for AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment
Figure 3 for AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment
Figure 4 for AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment
Viaarxiv icon

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Add code
Jun 28, 2024
Figure 1 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Figure 2 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Figure 3 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Figure 4 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Viaarxiv icon

CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models

Add code
May 20, 2024
Viaarxiv icon