Picture for Zhifang Sui

Zhifang Sui

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

HauntAttack: When Attack Follows Reasoning as a Shadow

Add code
Jun 08, 2025
Viaarxiv icon

Towards Harmonized Uncertainty Estimation for Large Language Models

Add code
May 25, 2025
Viaarxiv icon

RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection

Add code
May 18, 2025
Viaarxiv icon

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Add code
May 16, 2025
Viaarxiv icon

ICon: In-Context Contribution for Automatic Data Selection

Add code
May 08, 2025
Viaarxiv icon

Chain-of-Thought Tokens are Computer Program Variables

Add code
May 08, 2025
Viaarxiv icon

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Add code
Mar 31, 2025
Viaarxiv icon

KnowLogic: A Benchmark for Commonsense Reasoning via Knowledge-Driven Data Synthesis

Add code
Mar 08, 2025
Viaarxiv icon

Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming

Add code
Feb 22, 2025
Viaarxiv icon