Picture for Hongzhi Zhang

Hongzhi Zhang

DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding

Add code
Jun 04, 2025
Viaarxiv icon

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

Clapper: Compact Learning and Video Representation in VLMs

Add code
May 21, 2025
Viaarxiv icon

Pseudo-Label Guided Real-World Image De-weathering: A Learning Framework with Imperfect Supervision

Add code
Apr 14, 2025
Viaarxiv icon

Routing to the Right Expertise: A Trustworthy Judge for Instruction-based Image Editing

Add code
Apr 10, 2025
Viaarxiv icon

Small Molecule Drug Discovery Through Deep Learning:Progress, Challenges, and Opportunities

Add code
Feb 13, 2025
Viaarxiv icon

UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity

Add code
Dec 28, 2024
Figure 1 for UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
Figure 2 for UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
Figure 3 for UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
Figure 4 for UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
Viaarxiv icon

VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models

Add code
Oct 02, 2024
Figure 1 for VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
Figure 2 for VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
Figure 3 for VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
Figure 4 for VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
Viaarxiv icon

Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector

Add code
Jun 17, 2024
Figure 1 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Figure 2 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Figure 3 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Figure 4 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Viaarxiv icon