Picture for Shijin Wang

Shijin Wang

Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving

Add code
Jun 10, 2025
Viaarxiv icon

CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective

Add code
Jun 04, 2025
Viaarxiv icon

MMATH: A Multilingual Benchmark for Mathematical Reasoning

Add code
May 25, 2025
Viaarxiv icon

How Does Sequence Modeling Architecture Influence Base Capabilities of Pre-trained Language Models? Exploring Key Architecture Design Principles to Avoid Base Capabilities Degradation

Add code
May 24, 2025
Viaarxiv icon

am-ELO: A Stable Framework for Arena-based LLM Evaluation

Add code
May 06, 2025
Viaarxiv icon

MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers

Add code
Apr 14, 2025
Viaarxiv icon

Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter

Add code
Mar 07, 2025
Viaarxiv icon

NLP-AKG: Few-Shot Construction of NLP Academic Knowledge Graph Based on LLM

Add code
Feb 20, 2025
Viaarxiv icon

Ontology-Guided Reverse Thinking Makes Large Language Models Stronger on Knowledge Graph Question Answering

Add code
Feb 17, 2025
Viaarxiv icon

StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models

Add code
Feb 13, 2025
Viaarxiv icon