Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Karl Willis

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sep 26, 2024

Sifan Wu, Amir Khasahmadi, Mor Katz, Pradeep Kumar Jayaraman, Yewen Pu, Karl Willis, Bang Liu

Figure 1 for CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Figure 2 for CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Figure 3 for CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Figure 4 for CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Abstract:Parametric Computer-Aided Design (CAD) is central to contemporary mechanical design. However, it encounters challenges in achieving precise parametric sketch modeling and lacks practical evaluation metrics suitable for mechanical design. We harness the capabilities of pre-trained foundation models, renowned for their successes in natural language processing and computer vision, to develop generative models specifically for CAD. These models are adept at understanding complex geometries and design reasoning, a crucial advancement in CAD technology. In this paper, we propose CadVLM, an end-to-end vision language model for CAD generation. Our approach involves adapting pre-trained foundation models to manipulate engineering sketches effectively, integrating both sketch primitive sequences and sketch images. Extensive experiments demonstrate superior performance on multiple CAD sketch generation tasks such as CAD autocompletion, CAD autoconstraint, and image conditional generation. To our knowledge, this is the first instance of a multimodal Large Language Model (LLM) being successfully applied to parametric CAD generation, representing a pioneering step in the field of computer-aided mechanical design.

Via

Access Paper or Ask Questions

TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text

Nov 04, 2022

Aditya Sanghi, Rao Fu, Vivian Liu, Karl Willis, Hooman Shayani, Amir Hosein Khasahmadi, Srinath Sridhar, Daniel Ritchie

Figure 1 for TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text

Figure 2 for TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text

Figure 3 for TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text

Figure 4 for TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text

Abstract:Language is one of the primary means by which we describe the 3D world around us. While rapid progress has been made in text-to-2D-image synthesis, similar progress in text-to-3D-shape synthesis has been hindered by the lack of paired (text, shape) data. Moreover, extant methods for text-to-shape generation have limited shape diversity and fidelity. We introduce TextCraft, a method to address these limitations by producing high-fidelity and diverse 3D shapes without the need for (text, shape) pairs for training. TextCraft achieves this by using CLIP and using a multi-resolution approach by first generating in a low-dimensional latent space and then upscaling to a higher resolution, improving the fidelity of the generated shape. To improve shape diversity, we use a discrete latent space which is modelled using a bidirectional transformer conditioned on the interchangeable image-text embedding space induced by CLIP. Moreover, we present a novel variant of classifier-free guidance, which further improves the accuracy-diversity trade-off. Finally, we perform extensive experiments that demonstrate that TextCraft outperforms state-of-the-art baselines.

Via

Access Paper or Ask Questions

Mates2Motion: Learning How Mechanical CAD Assemblies Work

Aug 02, 2022

James Noeckel, Benjamin T. Jones, Karl Willis, Brian Curless, Adriana Schulz

Figure 1 for Mates2Motion: Learning How Mechanical CAD Assemblies Work

Figure 2 for Mates2Motion: Learning How Mechanical CAD Assemblies Work

Abstract:We describe our work on inferring the degrees of freedom between mated parts in mechanical assemblies using deep learning on CAD representations. We train our model using a large dataset of real-world mechanical assemblies consisting of CAD parts and mates joining them together. We present methods for re-defining these mates to make them better reflect the motion of the assembly, as well as narrowing down the possible axes of motion. We also conduct a user study to create a motion-annotated test set with more reliable labels.

* Contains 5 pages, 2 figures. Presented at the ICML 2022 Workshop on Machine Learning in Computational Design

Via

Access Paper or Ask Questions