Picture for Jingnan Zheng

Jingnan Zheng

RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards

Add code
Jun 09, 2025
Viaarxiv icon

Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models

Add code
Nov 19, 2024
Figure 1 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Figure 2 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Figure 3 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Figure 4 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Viaarxiv icon

MASKDROID: Robust Android Malware Detection with Masked Graph Representations

Add code
Sep 29, 2024
Figure 1 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Figure 2 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Figure 3 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Figure 4 for MASKDROID: Robust Android Malware Detection with Masked Graph Representations
Viaarxiv icon

ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation

Add code
May 23, 2024
Figure 1 for ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Figure 2 for ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Figure 3 for ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Figure 4 for ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Viaarxiv icon

Robust Collaborative Filtering to Popularity Distribution Shift

Add code
Oct 16, 2023
Figure 1 for Robust Collaborative Filtering to Popularity Distribution Shift
Figure 2 for Robust Collaborative Filtering to Popularity Distribution Shift
Figure 3 for Robust Collaborative Filtering to Popularity Distribution Shift
Figure 4 for Robust Collaborative Filtering to Popularity Distribution Shift
Viaarxiv icon

Invariant Collaborative Filtering to Popularity Distribution Shift

Add code
Feb 13, 2023
Viaarxiv icon