Picture for Zhenkai Liang

Zhenkai Liang

Cosine Misleads: Auxiliary Losses Reshape Vision Language Models, Not Their Latents

Add code
Jun 04, 2026
Viaarxiv icon

Self-Evaluation Is Already There: Eliciting Latent Judge Calibration in Base LLMs with Minimal Data

Add code
Jun 03, 2026
Viaarxiv icon

Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer

Add code
Feb 22, 2026
Viaarxiv icon

Self-Guard: Defending Large Reasoning Models via enhanced self-reflection

Add code
Jan 31, 2026
Viaarxiv icon

DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle

Add code
Jan 27, 2026
Viaarxiv icon

RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards

Add code
Jun 09, 2025
Figure 1 for RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards
Figure 2 for RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards
Figure 3 for RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards
Figure 4 for RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards
Viaarxiv icon

AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint

Add code
Jun 08, 2025
Figure 1 for AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
Figure 2 for AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
Figure 3 for AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
Figure 4 for AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
Viaarxiv icon

OSS-Bench: Benchmark Generator for Coding LLMs

Add code
May 18, 2025
Figure 1 for OSS-Bench: Benchmark Generator for Coding LLMs
Figure 2 for OSS-Bench: Benchmark Generator for Coding LLMs
Figure 3 for OSS-Bench: Benchmark Generator for Coding LLMs
Figure 4 for OSS-Bench: Benchmark Generator for Coding LLMs
Viaarxiv icon

AttackSeqBench: Benchmarking Large Language Models' Understanding of Sequential Patterns in Cyber Attacks

Add code
Mar 05, 2025
Viaarxiv icon

MASKDROID: Robust Android Malware Detection with Masked Graph Representations

Add code
Sep 29, 2024
Figure 1 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Figure 2 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Figure 3 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Figure 4 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Viaarxiv icon