Picture for Yingzi Ma

Yingzi Ma

MaskForge: Structure-Aware Adaptive Attacks for Jailbreaking Diffusion Large Language Models

Add code
Jun 01, 2026
Viaarxiv icon

GeoDrive-Bench: Benchmarking Region-Specific Multimodal Reasoning in Autonomous Driving

Add code
Jun 01, 2026
Viaarxiv icon

SafeGen-Bench: Benchmarking Safety in Image-Conditioned Text-to-Video Generation

Add code
May 31, 2026
Viaarxiv icon

When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning

Add code
May 20, 2026
Viaarxiv icon

Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset

Add code
Nov 05, 2024
Figure 1 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 2 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 3 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 4 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Viaarxiv icon

MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts

Add code
Apr 22, 2024
Figure 1 for MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts
Figure 2 for MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts
Figure 3 for MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts
Figure 4 for MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts
Viaarxiv icon

Multi: Multimodal Understanding Leaderboard with Text and Images

Add code
Feb 05, 2024
Figure 1 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 2 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 3 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 4 for Multi: Multimodal Understanding Leaderboard with Text and Images
Viaarxiv icon

Dolphins: Multimodal Language Model for Driving

Add code
Dec 01, 2023
Figure 1 for Dolphins: Multimodal Language Model for Driving
Figure 2 for Dolphins: Multimodal Language Model for Driving
Figure 3 for Dolphins: Multimodal Language Model for Driving
Figure 4 for Dolphins: Multimodal Language Model for Driving
Viaarxiv icon