Picture for Dandan Guo

Dandan Guo

Senior Member, IEEE

Mitigating Reward Hacking in RLHF via Bayesian Non-negative Reward Modeling

Add code
Feb 11, 2026
Viaarxiv icon

Calibrating Tabular Anomaly Detection via Optimal Transport

Add code
Feb 06, 2026
Viaarxiv icon

Safeguarding LLM Fine-tuning via Push-Pull Distributional Alignment

Add code
Jan 12, 2026
Viaarxiv icon

PDR: A Plug-and-Play Positional Decay Framework for LLM Pre-training Data Detection

Add code
Jan 11, 2026
Viaarxiv icon

A Guardrail for Safety Preservation: When Safety-Sensitive Subspace Meets Harmful-Resistant Null-Space

Add code
Oct 16, 2025
Viaarxiv icon

Deep Neural Network Calibration by Reducing Classifier Shift with Stochastic Masking

Add code
Aug 12, 2025
Viaarxiv icon

Merging Smarter, Generalizing Better: Enhancing Model Merging on OOD Data

Add code
Jun 10, 2025
Viaarxiv icon

LLM Meeting Decision Trees on Tabular Data

Add code
May 23, 2025
Viaarxiv icon

Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning

Add code
Apr 16, 2025
Viaarxiv icon

Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration

Add code
Apr 14, 2025
Viaarxiv icon