Picture for Dong Zhang

Dong Zhang

Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Alignment in MLLM

Add code
Sep 18, 2025
Viaarxiv icon

UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets

Add code
Sep 18, 2025
Viaarxiv icon

Resource-Efficient Glioma Segmentation on Sub-Saharan MRI

Add code
Sep 11, 2025
Viaarxiv icon

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Add code
Aug 25, 2025
Viaarxiv icon

Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation

Add code
Jul 10, 2025
Viaarxiv icon

Relaxation-Free Min-k-Partition for PCI Assignment in 5G Networks

Add code
Jun 12, 2025
Viaarxiv icon

R-Genie: Reasoning-Guided Generative Image Editing

Add code
May 23, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Divide-and-Conquer: Cold-Start Bundle Recommendation via Mixture of Diffusion Experts

Add code
May 08, 2025
Viaarxiv icon

ExFace: Expressive Facial Control for Humanoid Robots with Diffusion Transformers and Bootstrap Training

Add code
Apr 20, 2025
Figure 1 for ExFace: Expressive Facial Control for Humanoid Robots with Diffusion Transformers and Bootstrap Training
Figure 2 for ExFace: Expressive Facial Control for Humanoid Robots with Diffusion Transformers and Bootstrap Training
Figure 3 for ExFace: Expressive Facial Control for Humanoid Robots with Diffusion Transformers and Bootstrap Training
Figure 4 for ExFace: Expressive Facial Control for Humanoid Robots with Diffusion Transformers and Bootstrap Training
Viaarxiv icon