Picture for Mike Kroutikov

Mike Kroutikov

Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations

Add code
Apr 15, 2024
Viaarxiv icon