Picture for Baosheng Wang

Baosheng Wang

LLM-Crowdsourced: A Benchmark-Free Paradigm for Mutual Evaluation of Large Language Models

Add code
Jul 30, 2025
Viaarxiv icon

Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration

Add code
Oct 19, 2024
Viaarxiv icon

Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology

Add code
Feb 24, 2024
Viaarxiv icon

Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models

Add code
Aug 25, 2023
Viaarxiv icon