The authors of Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think! have not publicly listed the code yet.
Request code directly from the authors:
Get an expert to implement this paper:
(OR if you have code to share with the community, please submit it here ✉️😊🙏)