photo


Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images

Add code
Jun 16, 2025
Figure 1 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 2 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 3 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 4 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Viaarxiv icon

Restoring Gaussian Blurred Face Images for Deanonymization Attacks

Add code
Jun 14, 2025
Viaarxiv icon

CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images

Add code
Jun 13, 2025
Viaarxiv icon

Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning

Add code
Jun 12, 2025
Figure 1 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Figure 2 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Figure 3 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Figure 4 for Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Viaarxiv icon

Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection

Add code
Jun 12, 2025
Viaarxiv icon

Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Add code
Jun 11, 2025
Figure 1 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Figure 2 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Figure 3 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Figure 4 for Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Viaarxiv icon

SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything

Add code
Jun 10, 2025
Viaarxiv icon

On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images

Add code
Jun 05, 2025
Figure 1 for On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images
Figure 2 for On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images
Figure 3 for On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images
Figure 4 for On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images
Viaarxiv icon

SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Add code
Jun 05, 2025
Figure 1 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 2 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 3 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 4 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Viaarxiv icon

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

Add code
Jun 04, 2025
Figure 1 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Figure 2 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Figure 3 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Figure 4 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Viaarxiv icon