photo


SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Add code
Jun 05, 2025
Figure 1 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 2 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 3 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Figure 4 for SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Viaarxiv icon

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

Add code
Jun 04, 2025
Figure 1 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Figure 2 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Figure 3 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Figure 4 for Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
Viaarxiv icon

Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing

Add code
May 30, 2025
Viaarxiv icon

Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion

Add code
May 30, 2025
Figure 1 for Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion
Figure 2 for Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion
Figure 3 for Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion
Figure 4 for Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion
Viaarxiv icon

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

Add code
May 29, 2025
Viaarxiv icon

Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch

Add code
May 29, 2025
Figure 1 for Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Figure 2 for Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Figure 3 for Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Figure 4 for Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Viaarxiv icon

Generating Fit Check Videos with a Handheld Camera

Add code
May 29, 2025
Viaarxiv icon

DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP

Add code
May 29, 2025
Viaarxiv icon

PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization

Add code
May 28, 2025
Viaarxiv icon

Guess the Age of Photos: An Interactive Web Platform for Historical Image Age Estimation

Add code
May 28, 2025
Viaarxiv icon