Picture for Jinchao Zhang

Jinchao Zhang

Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection

Add code
Mar 25, 2026
Viaarxiv icon

Evaluating Generative Models via One-Dimensional Code Distributions

Add code
Mar 12, 2026
Viaarxiv icon

Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance

Add code
Mar 12, 2026
Viaarxiv icon

Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity

Add code
Mar 11, 2026
Viaarxiv icon

Tutti: Expressive Multi-Singer Synthesis via Structure-Level Timbre Control and Vocal Texture Modeling

Add code
Feb 09, 2026
Viaarxiv icon

Exploring Specular Reflection Inconsistency for Generalizable Face Forgery Detection

Add code
Feb 06, 2026
Viaarxiv icon

StyleDecoupler: Generalizable Artistic Style Disentanglement

Add code
Jan 25, 2026
Viaarxiv icon

F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model

Add code
Aug 25, 2025
Viaarxiv icon

Enhancing Visual Reliance in Text Generation: A Bayesian Perspective on Mitigating Hallucination in Large Vision-Language Models

Add code
May 26, 2025
Viaarxiv icon

Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark

Add code
Apr 24, 2025
Viaarxiv icon