Picture for Ruotian Ma

Ruotian Ma

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

Add code
May 20, 2025
Viaarxiv icon

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Add code
May 01, 2025
Viaarxiv icon

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

Add code
Apr 27, 2025
Viaarxiv icon

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Add code
Feb 18, 2025
Viaarxiv icon

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Add code
Oct 20, 2024
Viaarxiv icon

Are Large Language Models Good Prompt Optimizers?

Add code
Feb 03, 2024
Figure 1 for Are Large Language Models Good Prompt Optimizers?
Figure 2 for Are Large Language Models Good Prompt Optimizers?
Figure 3 for Are Large Language Models Good Prompt Optimizers?
Figure 4 for Are Large Language Models Good Prompt Optimizers?
Viaarxiv icon

Making Harmful Behaviors Unlearnable for Large Language Models

Add code
Nov 02, 2023
Viaarxiv icon

Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?

Add code
Dec 21, 2022
Viaarxiv icon

Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER

Add code
Oct 10, 2022
Figure 1 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Figure 2 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Figure 3 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Figure 4 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Viaarxiv icon