Picture for Shichao Sun

Shichao Sun

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

Add code
Aug 15, 2024
Figure 1 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 2 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 3 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 4 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Viaarxiv icon

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Add code
Aug 13, 2024
Figure 1 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 2 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 3 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 4 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Viaarxiv icon

FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models

Add code
Jul 01, 2024
Figure 1 for FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Figure 2 for FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Figure 3 for FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Figure 4 for FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon

Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization

Add code
Jun 01, 2024
Figure 1 for Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
Figure 2 for Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
Figure 3 for Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
Figure 4 for Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
Viaarxiv icon

Dissecting Human and LLM Preferences

Add code
Feb 17, 2024
Figure 1 for Dissecting Human and LLM Preferences
Figure 2 for Dissecting Human and LLM Preferences
Figure 3 for Dissecting Human and LLM Preferences
Figure 4 for Dissecting Human and LLM Preferences
Viaarxiv icon

The Critique of Critique

Add code
Jan 09, 2024
Figure 1 for The Critique of Critique
Figure 2 for The Critique of Critique
Figure 3 for The Critique of Critique
Figure 4 for The Critique of Critique
Viaarxiv icon

Evolving Large Language Model Assistant with Long-Term Conditional Memory

Add code
Dec 22, 2023
Figure 1 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Figure 2 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Figure 3 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Figure 4 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Viaarxiv icon

Aligning Language Models with Human Preferences via a Bayesian Approach

Add code
Oct 09, 2023
Figure 1 for Aligning Language Models with Human Preferences via a Bayesian Approach
Figure 2 for Aligning Language Models with Human Preferences via a Bayesian Approach
Figure 3 for Aligning Language Models with Human Preferences via a Bayesian Approach
Figure 4 for Aligning Language Models with Human Preferences via a Bayesian Approach
Viaarxiv icon

Generative Judge for Evaluating Alignment

Add code
Oct 09, 2023
Figure 1 for Generative Judge for Evaluating Alignment
Figure 2 for Generative Judge for Evaluating Alignment
Figure 3 for Generative Judge for Evaluating Alignment
Figure 4 for Generative Judge for Evaluating Alignment
Viaarxiv icon