Picture for Min Zhang

Min Zhang

Jake

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Add code
May 25, 2025
Figure 1 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Figure 2 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Figure 3 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Figure 4 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Viaarxiv icon

Knowledge Grafting of Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer

Add code
May 24, 2025
Viaarxiv icon

Wolf Hidden in Sheep's Conversations: Toward Harmless Data-Based Backdoor Attacks for Jailbreaking Large Language Models

Add code
May 23, 2025
Viaarxiv icon

GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent

Add code
May 22, 2025
Viaarxiv icon

MTSA: Multi-turn Safety Alignment for LLMs through Multi-round Red-teaming

Add code
May 22, 2025
Viaarxiv icon

Dynamic Sampling that Adapts: Iterative DPO for Self-Aware Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

Multi-Modality Expansion and Retention for LLMs through Parameter Merging and Decoupling

Add code
May 21, 2025
Viaarxiv icon

Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors

Add code
May 21, 2025
Figure 1 for Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Figure 2 for Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Figure 3 for Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Figure 4 for Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Viaarxiv icon

Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification

Add code
May 19, 2025
Figure 1 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Figure 2 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Figure 3 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Figure 4 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Viaarxiv icon