Picture for Kaixun Jiang

Kaixun Jiang

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Add code
May 14, 2026
Viaarxiv icon

Unified Multimodal Visual Tracking with Dual Mixture-of-Experts

Add code
May 05, 2026
Viaarxiv icon

AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation

Add code
Mar 31, 2026
Viaarxiv icon

GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations

Add code
Dec 30, 2025
Viaarxiv icon

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Add code
Dec 15, 2025
Viaarxiv icon

Seeing is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding

Add code
Nov 15, 2025
Viaarxiv icon

Improving Multimodal Sentiment Analysis via Modality Optimization and Dynamic Primary Modality Selection

Add code
Nov 14, 2025
Viaarxiv icon

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Add code
Jun 17, 2025
Viaarxiv icon

VideoPure: Diffusion-based Adversarial Purification for Video Recognition

Add code
Jan 25, 2025
Figure 1 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 2 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 3 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 4 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Viaarxiv icon