photo


Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images

Add code
Jun 16, 2025
Figure 1 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 2 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 3 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 4 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Viaarxiv icon

UltraZoom: Generating Gigapixel Images from Regular Photos

Add code
Jun 16, 2025
Viaarxiv icon

WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild

Add code
Jun 16, 2025
Viaarxiv icon

Restoring Gaussian Blurred Face Images for Deanonymization Attacks

Add code
Jun 14, 2025
Viaarxiv icon

CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images

Add code
Jun 13, 2025
Viaarxiv icon

Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection

Add code
Jun 12, 2025
Viaarxiv icon

Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning

Add code
Jun 12, 2025
Figure 1 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Figure 2 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Figure 3 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Figure 4 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Viaarxiv icon

Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Add code
Jun 11, 2025
Figure 1 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Figure 2 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Figure 3 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Figure 4 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Viaarxiv icon

SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything

Add code
Jun 10, 2025
Viaarxiv icon

SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Add code
Jun 05, 2025
Figure 1 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 2 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 3 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 4 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Viaarxiv icon