Picture for Shizhu He

Shizhu He

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns

Add code
May 29, 2025
Viaarxiv icon

Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN

Add code
May 22, 2025
Viaarxiv icon

Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention

Add code
May 21, 2025
Viaarxiv icon

Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models

Add code
May 20, 2025
Viaarxiv icon

Better wit than wealth: Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

Add code
Mar 31, 2025
Viaarxiv icon

Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model

Add code
Mar 28, 2025
Viaarxiv icon

GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks

Add code
Feb 20, 2025
Viaarxiv icon

DATA: Decomposed Attention-based Task Adaptation for Rehearsal-Free Continual Learning

Add code
Feb 17, 2025
Viaarxiv icon

VaiBot: Shuttle Between the Instructions and Parameters

Add code
Feb 04, 2025
Viaarxiv icon

LLaSA: Large Language and Structured Data Assistant

Add code
Nov 16, 2024
Viaarxiv icon