Picture for Yue Hu

Yue Hu

Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering and Applied Sciences

Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service

Add code
Nov 10, 2023
Figure 1 for Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service
Figure 2 for Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service
Figure 3 for Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service
Figure 4 for Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service
Viaarxiv icon

S2F-NER: Exploring Sequence-to-Forest Generation for Complex Entity Recognition

Add code
Oct 29, 2023
Viaarxiv icon

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning

Add code
Oct 26, 2023
Viaarxiv icon

Exploiting Manifold Structured Data Priors for Improved MR Fingerprinting Reconstruction

Add code
Oct 17, 2023
Viaarxiv icon

Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow

Add code
Oct 09, 2023
Figure 1 for Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow
Figure 2 for Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow
Figure 3 for Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow
Figure 4 for Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow
Viaarxiv icon

Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search

Add code
Sep 28, 2023
Figure 1 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Figure 2 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Figure 3 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Figure 4 for Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
Viaarxiv icon

Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval

Add code
Sep 28, 2023
Viaarxiv icon

Let's Roll: Synthetic Dataset Analysis for Pedestrian Detection Across Different Shutter Types

Add code
Sep 15, 2023
Figure 1 for Let's Roll: Synthetic Dataset Analysis for Pedestrian Detection Across Different Shutter Types
Figure 2 for Let's Roll: Synthetic Dataset Analysis for Pedestrian Detection Across Different Shutter Types
Figure 3 for Let's Roll: Synthetic Dataset Analysis for Pedestrian Detection Across Different Shutter Types
Figure 4 for Let's Roll: Synthetic Dataset Analysis for Pedestrian Detection Across Different Shutter Types
Viaarxiv icon

Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation

Add code
Sep 07, 2023
Viaarxiv icon

Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment

Add code
Aug 27, 2023
Figure 1 for Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment
Figure 2 for Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment
Figure 3 for Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment
Figure 4 for Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment
Viaarxiv icon