photo


MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills

Add code
May 09, 2025
Figure 1 for MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills
Figure 2 for MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills
Figure 3 for MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills
Figure 4 for MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills
Viaarxiv icon

My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing

Add code
May 09, 2025
Viaarxiv icon

Multi-turn Consistent Image Editing

Add code
May 07, 2025
Viaarxiv icon

Advancing Conversational Diagnostic AI with Multimodal Reasoning

Add code
May 06, 2025
Figure 1 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Figure 2 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Figure 3 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Figure 4 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Viaarxiv icon

Towards Smart Point-and-Shoot Photography

Add code
May 06, 2025
Figure 1 for Towards Smart Point-and-Shoot Photography
Figure 2 for Towards Smart Point-and-Shoot Photography
Figure 3 for Towards Smart Point-and-Shoot Photography
Figure 4 for Towards Smart Point-and-Shoot Photography
Viaarxiv icon

SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments

Add code
Apr 30, 2025
Figure 1 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Figure 2 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Figure 3 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Figure 4 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Viaarxiv icon

CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos

Add code
Apr 24, 2025
Figure 1 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Figure 2 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Figure 3 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Figure 4 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Viaarxiv icon

A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers

Add code
Apr 22, 2025
Viaarxiv icon

Fragile Watermarking for Image Certification Using Deep Steganographic Embedding

Add code
Apr 18, 2025
Viaarxiv icon

Green Robotic Mixed Reality with Gaussian Splatting

Add code
Apr 18, 2025
Figure 1 for Green Robotic Mixed Reality with Gaussian Splatting
Figure 2 for Green Robotic Mixed Reality with Gaussian Splatting
Figure 3 for Green Robotic Mixed Reality with Gaussian Splatting
Figure 4 for Green Robotic Mixed Reality with Gaussian Splatting
Viaarxiv icon