Picture for Dianbo Sui

Dianbo Sui

ScEdit: Script-based Assessment of Knowledge Editing

Add code
May 29, 2025
Viaarxiv icon

Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers

Add code
May 26, 2025
Viaarxiv icon

Large Language Models for Planning: A Comprehensive and Systematic Survey

Add code
May 26, 2025
Viaarxiv icon

LFTF: Locating First and Then Fine-Tuning for Mitigating Gender Bias in Large Language Models

Add code
May 21, 2025
Viaarxiv icon

LLMSR@XLLM25: An Empirical Study of LLM for Structural Reasoning

Add code
May 18, 2025
Viaarxiv icon

TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs

Add code
Oct 14, 2024
Figure 1 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 2 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 3 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 4 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Viaarxiv icon

Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data

Add code
Oct 10, 2024
Figure 1 for Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
Figure 2 for Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
Figure 3 for Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
Figure 4 for Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
Viaarxiv icon

Mitigating Gender Bias in Code Large Language Models via Model Editing

Add code
Oct 10, 2024
Viaarxiv icon

HBot: A Chatbot for Healthcare Applications in Traditional Chinese Medicine Based on Human Body 3D Visualization

Add code
Aug 01, 2024
Viaarxiv icon

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Add code
Jul 02, 2024
Viaarxiv icon