Picture for Dingkang Yang

Dingkang Yang

SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model

Add code
Oct 14, 2025
Viaarxiv icon

MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs

Add code
Oct 02, 2025
Viaarxiv icon

SAIL-VL2 Technical Report

Add code
Sep 18, 2025
Viaarxiv icon

PersonaAnimator: Personalized Motion Transfer from Unconstrained Videos

Add code
Aug 27, 2025
Viaarxiv icon

eMotions: A Large-Scale Dataset and Audio-Visual Fusion Network for Emotion Analysis in Short-form Videos

Add code
Aug 09, 2025
Viaarxiv icon

SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement

Add code
Jul 02, 2025
Figure 1 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Figure 2 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Figure 3 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Figure 4 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Viaarxiv icon

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Add code
Jun 17, 2025
Viaarxiv icon

AdaLRS: Loss-Guided Adaptive Learning Rate Search for Efficient Foundation Model Pretraining

Add code
Jun 16, 2025
Viaarxiv icon

DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding

Add code
May 23, 2025
Viaarxiv icon

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Add code
May 05, 2025
Viaarxiv icon