Picture for Sjoerd van Steenkiste

Sjoerd van Steenkiste

Benchmarking Vision Language Models for Cultural Understanding

Add code
Jul 15, 2024
Figure 1 for Benchmarking Vision Language Models for Cultural Understanding
Figure 2 for Benchmarking Vision Language Models for Cultural Understanding
Figure 3 for Benchmarking Vision Language Models for Cultural Understanding
Figure 4 for Benchmarking Vision Language Models for Cultural Understanding
Viaarxiv icon

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models

Add code
Jun 13, 2024
Viaarxiv icon

DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback

Add code
Nov 29, 2023
Figure 1 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 2 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 3 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 4 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Viaarxiv icon

A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models

Add code
Nov 01, 2023
Figure 1 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Figure 2 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Figure 3 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Figure 4 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Viaarxiv icon

The Impact of Depth and Width on Transformer Language Model Generalization

Add code
Oct 30, 2023
Viaarxiv icon

DyST: Towards Dynamic Neural Scene Representations on Real-World Videos

Add code
Oct 09, 2023
Viaarxiv icon

DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$

Add code
Jun 13, 2023
Figure 1 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Figure 2 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Figure 3 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Figure 4 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Viaarxiv icon

Sensitivity of Slot-Based Object-Centric Models to their Number of Slots

Add code
May 30, 2023
Figure 1 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 2 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 3 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 4 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Viaarxiv icon

Scaling Vision Transformers to 22 Billion Parameters

Add code
Feb 10, 2023
Figure 1 for Scaling Vision Transformers to 22 Billion Parameters
Figure 2 for Scaling Vision Transformers to 22 Billion Parameters
Figure 3 for Scaling Vision Transformers to 22 Billion Parameters
Figure 4 for Scaling Vision Transformers to 22 Billion Parameters
Viaarxiv icon

Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames

Add code
Feb 09, 2023
Figure 1 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 2 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 3 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 4 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Viaarxiv icon