Picture for Zhi-Qi Cheng

Zhi-Qi Cheng

HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation

Add code
May 15, 2025
Viaarxiv icon

Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions

Add code
Apr 16, 2025
Viaarxiv icon

Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard

Add code
Mar 18, 2025
Viaarxiv icon

MaxSup: Overcoming Representation Collapse in Label Smoothing

Add code
Feb 18, 2025
Viaarxiv icon

A Video-grounded Dialogue Dataset and Metric for Event-driven Activities

Add code
Jan 30, 2025
Figure 1 for A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
Figure 2 for A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
Figure 3 for A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
Figure 4 for A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
Viaarxiv icon

UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval

Add code
Dec 14, 2024
Figure 1 for UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval
Figure 2 for UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval
Figure 3 for UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval
Figure 4 for UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval
Viaarxiv icon

StableAnimator: High-Quality Identity-Preserving Human Image Animation

Add code
Nov 26, 2024
Viaarxiv icon

ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding

Add code
Oct 29, 2024
Viaarxiv icon

Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios

Add code
Oct 22, 2024
Figure 1 for Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Figure 2 for Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Figure 3 for Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Figure 4 for Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Viaarxiv icon