Picture for Feng Zheng

Feng Zheng

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models

Add code
Jul 26, 2023
Viaarxiv icon

LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad

Add code
Jul 23, 2023
Viaarxiv icon

K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment

Add code
Jul 10, 2023
Viaarxiv icon

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning

Add code
Jun 17, 2023
Viaarxiv icon

Extremely large-scale Array Systems: Near-Filed Codebook Design and Performance Analysis

Add code
Jun 02, 2023
Viaarxiv icon

CageViT: Convolutional Activation Guided Efficient Vision Transformer

Add code
May 17, 2023
Viaarxiv icon

Track Anything: Segment Anything Meets Videos

Add code
Apr 28, 2023
Viaarxiv icon

Can Decentralized Stochastic Minimax Optimization Algorithms Converge Linearly for Finite-Sum Nonconvex-Nonconcave Problems?

Add code
Apr 24, 2023
Viaarxiv icon

What makes a good data augmentation for few-shot unsupervised image anomaly detection?

Add code
Apr 21, 2023
Viaarxiv icon

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline

Add code
Mar 24, 2023
Figure 1 for Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Figure 2 for Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Figure 3 for Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Figure 4 for Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Viaarxiv icon