photo


Multimodal Integrated Knowledge Transfer to Large Language Models through Preference Optimization with Biomedical Applications

Add code
May 09, 2025
Viaarxiv icon

My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing

Add code
May 09, 2025
Viaarxiv icon

MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills

Add code
May 09, 2025
Viaarxiv icon

Multi-turn Consistent Image Editing

Add code
May 07, 2025
Viaarxiv icon

Towards Smart Point-and-Shoot Photography

Add code
May 06, 2025
Figure 1 for Towards Smart Point-and-Shoot Photography
Figure 2 for Towards Smart Point-and-Shoot Photography
Figure 3 for Towards Smart Point-and-Shoot Photography
Figure 4 for Towards Smart Point-and-Shoot Photography
Viaarxiv icon

Advancing Conversational Diagnostic AI with Multimodal Reasoning

Add code
May 06, 2025
Figure 1 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Figure 2 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Figure 3 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Figure 4 for Advancing Conversational Diagnostic AI with Multimodal Reasoning
Viaarxiv icon

SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments

Add code
Apr 30, 2025
Figure 1 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Figure 2 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Figure 3 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Figure 4 for SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
Viaarxiv icon

CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos

Add code
Apr 24, 2025
Figure 1 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Figure 2 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Figure 3 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Figure 4 for CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Viaarxiv icon

A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers

Add code
Apr 22, 2025
Viaarxiv icon

Green Robotic Mixed Reality with Gaussian Splatting

Add code
Apr 18, 2025
Figure 1 for Green Robotic Mixed Reality with Gaussian Splatting
Figure 2 for Green Robotic Mixed Reality with Gaussian Splatting
Figure 3 for Green Robotic Mixed Reality with Gaussian Splatting
Figure 4 for Green Robotic Mixed Reality with Gaussian Splatting
Viaarxiv icon