Picture for Tianxiong Zhong

Tianxiong Zhong

Diffusing in the Right Space: A Systematic Study of Latent Diffusability

Add code
Jun 02, 2026
Viaarxiv icon

VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization

Add code
Jun 01, 2026
Viaarxiv icon

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Add code
May 17, 2025
Figure 1 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 2 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 3 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 4 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Viaarxiv icon

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Add code
Nov 22, 2024
Figure 1 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 2 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 3 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 4 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Viaarxiv icon

Preprocessing Enhanced Image Compression for Machine Vision

Add code
Jun 12, 2022
Figure 1 for Preprocessing Enhanced Image Compression for Machine Vision
Figure 2 for Preprocessing Enhanced Image Compression for Machine Vision
Figure 3 for Preprocessing Enhanced Image Compression for Machine Vision
Figure 4 for Preprocessing Enhanced Image Compression for Machine Vision
Viaarxiv icon