Picture for Quan Zhang

Quan Zhang

Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?

Add code
Nov 13, 2024
Figure 1 for Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?
Figure 2 for Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?
Figure 3 for Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?
Figure 4 for Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?
Viaarxiv icon

Development of a Simple and Novel Digital Twin Framework for Industrial Robots in Intelligent robotics manufacturing

Add code
Oct 19, 2024
Viaarxiv icon

A Novel Approach to Grasping Control of Soft Robotic Grippers based on Digital Twin

Add code
Oct 19, 2024
Viaarxiv icon

GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization

Add code
Jun 24, 2024
Figure 1 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Figure 2 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Figure 3 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Figure 4 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Viaarxiv icon

Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map

Add code
May 24, 2024
Viaarxiv icon

Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered Applications

Add code
Apr 26, 2024
Viaarxiv icon

When Fuzzing Meets LLMs: Challenges and Opportunities

Add code
Apr 25, 2024
Viaarxiv icon

Distilling Semantic Priors from SAM to Efficient Image Restoration Models

Add code
Apr 02, 2024
Viaarxiv icon

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

Add code
Mar 21, 2024
Figure 1 for View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
Figure 2 for View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
Figure 3 for View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
Figure 4 for View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
Viaarxiv icon

MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection

Add code
Feb 18, 2024
Viaarxiv icon