Alert button
Picture for Thomas Kipf

Thomas Kipf

Alert button

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Add code
Bookmark button
Alert button
Mar 02, 2024
Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

Learning 3D Particle-based Simulators from RGB-D Videos

Add code
Bookmark button
Alert button
Dec 08, 2023
William F. Whitney, Tatiana Lopez-Guevara, Tobias Pfaff, Yulia Rubanova, Thomas Kipf, Kimberly Stachenfeld, Kelsey R. Allen

Viaarxiv icon

DyST: Towards Dynamic Neural Scene Representations on Real-World Videos

Add code
Bookmark button
Alert button
Oct 09, 2023
Maximilian Seitzer, Sjoerd van Steenkiste, Thomas Kipf, Klaus Greff, Mehdi S. M. Sajjadi

Viaarxiv icon

Video OWL-ViT: Temporally-consistent open-world localization in video

Add code
Bookmark button
Alert button
Aug 22, 2023
Georg Heigold, Matthias Minderer, Alexey Gritsenko, Alex Bewley, Daniel Keysers, Mario Lučić, Fisher Yu, Thomas Kipf

Figure 1 for Video OWL-ViT: Temporally-consistent open-world localization in video
Figure 2 for Video OWL-ViT: Temporally-consistent open-world localization in video
Figure 3 for Video OWL-ViT: Temporally-consistent open-world localization in video
Figure 4 for Video OWL-ViT: Temporally-consistent open-world localization in video
Viaarxiv icon

One-shot Imitation Learning via Interaction Warping

Add code
Bookmark button
Alert button
Jun 21, 2023
Ondrej Biza, Skye Thompson, Kishore Reddy Pagidi, Abhinav Kumar, Elise van der Pol, Robin Walters, Thomas Kipf, Jan-Willem van de Meent, Lawson L. S. Wong, Robert Platt

Figure 1 for One-shot Imitation Learning via Interaction Warping
Figure 2 for One-shot Imitation Learning via Interaction Warping
Figure 3 for One-shot Imitation Learning via Interaction Warping
Figure 4 for One-shot Imitation Learning via Interaction Warping
Viaarxiv icon

DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$

Add code
Bookmark button
Alert button
Jun 13, 2023
Allan Jabri, Sjoerd van Steenkiste, Emiel Hoogeboom, Mehdi S. M. Sajjadi, Thomas Kipf

Figure 1 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Figure 2 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Figure 3 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Figure 4 for DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Viaarxiv icon

Sensitivity of Slot-Based Object-Centric Models to their Number of Slots

Add code
Bookmark button
Alert button
May 30, 2023
Roland S. Zimmermann, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Thomas Kipf, Klaus Greff

Figure 1 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 2 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 3 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 4 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Viaarxiv icon

AudioSlots: A slot-centric generative model for audio separation

Add code
Bookmark button
Alert button
May 09, 2023
Pradyumna Reddy, Scott Wisdom, Klaus Greff, John R. Hershey, Thomas Kipf

Figure 1 for AudioSlots: A slot-centric generative model for audio separation
Figure 2 for AudioSlots: A slot-centric generative model for audio separation
Figure 3 for AudioSlots: A slot-centric generative model for audio separation
Viaarxiv icon

Scaling Vision Transformers to 22 Billion Parameters

Add code
Bookmark button
Alert button
Feb 10, 2023
Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Patrick Collier, Alexey Gritsenko, Vighnesh Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetić, Dustin Tran, Thomas Kipf, Mario Lučić, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby

Figure 1 for Scaling Vision Transformers to 22 Billion Parameters
Figure 2 for Scaling Vision Transformers to 22 Billion Parameters
Figure 3 for Scaling Vision Transformers to 22 Billion Parameters
Figure 4 for Scaling Vision Transformers to 22 Billion Parameters
Viaarxiv icon

Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames

Add code
Bookmark button
Alert button
Feb 09, 2023
Ondrej Biza, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Gamaleldin F. Elsayed, Aravindh Mahendran, Thomas Kipf

Figure 1 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 2 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 3 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 4 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Viaarxiv icon