Picture for Hengqi Guo

Hengqi Guo

Prefix Probing: Lightweight Harmful Content Detection for Large Language Models

Add code
Dec 18, 2025
Viaarxiv icon

N-GLARE: An Non-Generative Latent Representation-Efficient LLM Safety Evaluator

Add code
Nov 18, 2025
Viaarxiv icon