Picture for Yilei Jiang

Yilei Jiang

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Add code
Jul 30, 2025
Viaarxiv icon

MSR-Align: Policy-Grounded Multimodal Alignment for Safety-Aware Reasoning in Vision-Language Models

Add code
Jun 24, 2025
Viaarxiv icon

Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

Add code
Jun 14, 2025
Viaarxiv icon

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Add code
May 27, 2025
Viaarxiv icon

HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States

Add code
Feb 21, 2025
Viaarxiv icon

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting

Add code
Dec 25, 2024
Viaarxiv icon

DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions

Add code
Dec 25, 2024
Viaarxiv icon

Event-Customized Image Generation

Add code
Oct 03, 2024
Figure 1 for Event-Customized Image Generation
Figure 2 for Event-Customized Image Generation
Figure 3 for Event-Customized Image Generation
Figure 4 for Event-Customized Image Generation
Viaarxiv icon