Picture for Shangqing Tu

Shangqing Tu

R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models

Add code
Jun 17, 2024
Figure 1 for R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
Figure 2 for R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
Figure 3 for R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
Figure 4 for R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
Viaarxiv icon

Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack

Add code
Jun 17, 2024
Figure 1 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Figure 2 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Figure 3 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Figure 4 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Viaarxiv icon

DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning

Add code
Jun 06, 2024
Figure 1 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Figure 2 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Figure 3 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Figure 4 for DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Viaarxiv icon

A Solution-based LLM API-using Methodology for Academic Information Seeking

Add code
May 24, 2024
Figure 1 for A Solution-based LLM API-using Methodology for Academic Information Seeking
Figure 2 for A Solution-based LLM API-using Methodology for Academic Information Seeking
Figure 3 for A Solution-based LLM API-using Methodology for Academic Information Seeking
Figure 4 for A Solution-based LLM API-using Methodology for Academic Information Seeking
Viaarxiv icon

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

Add code
Nov 13, 2023
Figure 1 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 2 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 3 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 4 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Viaarxiv icon

LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts

Add code
Aug 11, 2023
Figure 1 for LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Figure 2 for LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Figure 3 for LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Figure 4 for LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Viaarxiv icon

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

Add code
Jun 15, 2023
Figure 1 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Figure 2 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Figure 3 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Figure 4 for KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Viaarxiv icon

ChatLog: Recording and Analyzing ChatGPT Across Time

Add code
Apr 27, 2023
Figure 1 for ChatLog: Recording and Analyzing ChatGPT Across Time
Figure 2 for ChatLog: Recording and Analyzing ChatGPT Across Time
Figure 3 for ChatLog: Recording and Analyzing ChatGPT Across Time
Figure 4 for ChatLog: Recording and Analyzing ChatGPT Across Time
Viaarxiv icon

MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs

Add code
Apr 05, 2023
Figure 1 for MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs
Figure 2 for MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs
Figure 3 for MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs
Figure 4 for MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs
Viaarxiv icon

TWAG: A Topic-Guided Wikipedia Abstract Generator

Add code
Jun 29, 2021
Figure 1 for TWAG: A Topic-Guided Wikipedia Abstract Generator
Figure 2 for TWAG: A Topic-Guided Wikipedia Abstract Generator
Figure 3 for TWAG: A Topic-Guided Wikipedia Abstract Generator
Figure 4 for TWAG: A Topic-Guided Wikipedia Abstract Generator
Viaarxiv icon