Picture for Zhixin Zhang

Zhixin Zhang

Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation

Add code
Jul 23, 2025
Viaarxiv icon

Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization

Add code
May 22, 2025
Viaarxiv icon

Stackelberg Game Preference Optimization for Data-Efficient Alignment of Language Models

Add code
Feb 25, 2025
Viaarxiv icon

Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

Add code
Feb 25, 2025
Figure 1 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Figure 2 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Figure 3 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Figure 4 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Viaarxiv icon

Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

Add code
Oct 28, 2024
Figure 1 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 2 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 3 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 4 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Viaarxiv icon

CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification

Add code
Oct 07, 2024
Figure 1 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Figure 2 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Figure 3 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Figure 4 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Viaarxiv icon

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Add code
Feb 05, 2024
Viaarxiv icon

Online Vectorized HD Map Construction using Geometry

Add code
Dec 06, 2023
Figure 1 for Online Vectorized HD Map Construction using Geometry
Figure 2 for Online Vectorized HD Map Construction using Geometry
Figure 3 for Online Vectorized HD Map Construction using Geometry
Figure 4 for Online Vectorized HD Map Construction using Geometry
Viaarxiv icon

Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Add code
Oct 29, 2023
Figure 1 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Figure 2 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Figure 3 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Figure 4 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Viaarxiv icon

TransForensics: Image Forgery Localization with Dense Self-Attention

Add code
Aug 09, 2021
Figure 1 for TransForensics: Image Forgery Localization with Dense Self-Attention
Figure 2 for TransForensics: Image Forgery Localization with Dense Self-Attention
Figure 3 for TransForensics: Image Forgery Localization with Dense Self-Attention
Figure 4 for TransForensics: Image Forgery Localization with Dense Self-Attention
Viaarxiv icon