Picture for Dadi Guo

Dadi Guo

MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?

Add code
Jun 06, 2025
Viaarxiv icon

AIDBench: A benchmark for evaluating the authorship identification capability of large language models

Add code
Nov 20, 2024
Viaarxiv icon

Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data

Add code
May 23, 2024
Viaarxiv icon

P-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models

Add code
Nov 07, 2023
Viaarxiv icon

Multi-step Jailbreaking Privacy Attacks on ChatGPT

Add code
Apr 11, 2023
Viaarxiv icon