photo


CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images

Add code
Jun 13, 2025
Viaarxiv icon

Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection

Add code
Jun 12, 2025
Viaarxiv icon

Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning

Add code
Jun 12, 2025
Viaarxiv icon

Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Add code
Jun 11, 2025
Viaarxiv icon

SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything

Add code
Jun 10, 2025
Viaarxiv icon

On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images

Add code
Jun 05, 2025
Viaarxiv icon

SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Add code
Jun 05, 2025
Viaarxiv icon

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

Add code
Jun 04, 2025
Viaarxiv icon

Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing

Add code
May 30, 2025
Viaarxiv icon

Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion

Add code
May 30, 2025
Viaarxiv icon