Picture for Xinming Tu

Xinming Tu

Minta

BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks

Add code
Apr 27, 2026
Viaarxiv icon

An Evaluation of Large Language Models in Bioinformatics Research

Add code
Feb 21, 2024
Figure 1 for An Evaluation of Large Language Models in Bioinformatics Research
Figure 2 for An Evaluation of Large Language Models in Bioinformatics Research
Figure 3 for An Evaluation of Large Language Models in Bioinformatics Research
Figure 4 for An Evaluation of Large Language Models in Bioinformatics Research
Viaarxiv icon

What Should Data Science Education Do with Large Language Models?

Add code
Jul 07, 2023
Figure 1 for What Should Data Science Education Do with Large Language Models?
Figure 2 for What Should Data Science Education Do with Large Language Models?
Figure 3 for What Should Data Science Education Do with Large Language Models?
Figure 4 for What Should Data Science Education Do with Large Language Models?
Viaarxiv icon