Picture for Haiwen Hong

Haiwen Hong

Yuvion VL: A Multimodal Foundation Model for Adversarial Content and AI Safety

Add code
Jun 23, 2026
Viaarxiv icon

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Add code
May 28, 2026
Viaarxiv icon

Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts

Add code
Apr 09, 2026
Viaarxiv icon

Diffusion Probe: Generated Image Result Prediction Using CNN Probes

Add code
Mar 05, 2026
Viaarxiv icon

TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration

Add code
Mar 03, 2026
Viaarxiv icon

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Add code
Mar 03, 2026
Viaarxiv icon

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

Add code
Feb 02, 2026
Viaarxiv icon

YuFeng-XGuard: A Reasoning-Centric, Interpretable, and Flexible Guardrail Model for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Add code
Apr 04, 2025
Figure 1 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 2 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 3 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 4 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Viaarxiv icon

One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications

Add code
Dec 26, 2023
Viaarxiv icon