Picture for Cong Hu

Cong Hu

FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement

Add code
Mar 20, 2026
Viaarxiv icon

Multi-Paradigm Collaborative Adversarial Attack Against Multi-Modal Large Language Models

Add code
Mar 05, 2026
Viaarxiv icon

Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction

Add code
Mar 05, 2026
Viaarxiv icon

Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment

Add code
Dec 25, 2023
Figure 1 for Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment
Figure 2 for Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment
Figure 3 for Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment
Figure 4 for Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment
Viaarxiv icon

SDA-$x$Net: Selective Depth Attention Networks for Adaptive Multi-scale Feature Representation

Add code
Sep 21, 2022
Figure 1 for SDA-$x$Net: Selective Depth Attention Networks for Adaptive Multi-scale Feature Representation
Figure 2 for SDA-$x$Net: Selective Depth Attention Networks for Adaptive Multi-scale Feature Representation
Figure 3 for SDA-$x$Net: Selective Depth Attention Networks for Adaptive Multi-scale Feature Representation
Figure 4 for SDA-$x$Net: Selective Depth Attention Networks for Adaptive Multi-scale Feature Representation
Viaarxiv icon

ConSLT: A Token-level Contrastive Framework for Sign Language Translation

Add code
Apr 11, 2022
Figure 1 for ConSLT: A Token-level Contrastive Framework for Sign Language Translation
Figure 2 for ConSLT: A Token-level Contrastive Framework for Sign Language Translation
Figure 3 for ConSLT: A Token-level Contrastive Framework for Sign Language Translation
Figure 4 for ConSLT: A Token-level Contrastive Framework for Sign Language Translation
Viaarxiv icon

Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning

Add code
Sep 19, 2019
Figure 1 for Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning
Figure 2 for Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning
Figure 3 for Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning
Figure 4 for Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning
Viaarxiv icon

Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder

Add code
Nov 05, 2018
Figure 1 for Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder
Figure 2 for Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder
Figure 3 for Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder
Figure 4 for Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder
Viaarxiv icon