Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yixiao Kang

Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs

Dec 23, 2025

Cyrus Vachha, Yixiao Kang, Zach Dive, Ashwat Chidambaram, Anik Gupta, Eunice Jun, Bjoern Hartmann

Figure 1 for Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs

Figure 2 for Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs

Figure 3 for Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs

Figure 4 for Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs

Abstract:Authoring 3D scenes is a central task for spatial computing applications. Competing visions for lowering existing barriers are (1) focus on immersive, direct manipulation of 3D content or (2) leverage AI techniques that capture real scenes (3D Radiance Fields such as, NeRFs, 3D Gaussian Splatting) and modify them at a higher level of abstraction, at the cost of high latency. We unify the complementary strengths of these approaches and investigate how to integrate generative AI advances into real-time, immersive 3D Radiance Field editing. We introduce Dreamcrafter, a VR-based 3D scene editing system that: (1) provides a modular architecture to integrate generative AI algorithms; (2) combines different levels of control for creating objects, including natural language and direct manipulation; and (3) introduces proxy representations that support interaction during high-latency operations. We contribute empirical findings on control preferences and discuss how generative AI interfaces beyond text input enhance creativity in scene editing and world building.

* CHI 2025, Project page: https://dream-crafter.github.io/

Via

Access Paper or Ask Questions

AI-Driven Stylization of 3D Environments

Nov 09, 2024

Yuanbo Chen, Yixiao Kang, Yukun Song, Cyrus Vachha, Sining Huang

Figure 1 for AI-Driven Stylization of 3D Environments

Figure 2 for AI-Driven Stylization of 3D Environments

Figure 3 for AI-Driven Stylization of 3D Environments

Figure 4 for AI-Driven Stylization of 3D Environments

Abstract:In this system, we discuss methods to stylize a scene of 3D primitive objects into a higher fidelity 3D scene using novel 3D representations like NeRFs and 3D Gaussian Splatting. Our approach leverages existing image stylization systems and image-to-3D generative models to create a pipeline that iteratively stylizes and composites 3D objects into scenes. We show our results on adding generated objects into a scene and discuss limitations.

Via

Access Paper or Ask Questions

AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way

Sep 22, 2024

Sining Huang, Yukun Song, Yixiao Kang, Chang Yu

Figure 1 for AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way

Figure 2 for AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way

Figure 3 for AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way

Figure 4 for AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way

Abstract:In the field of spatial computing, one of the most essential tasks is the pose estimation of 3D objects. While rigid transformations of arbitrary 3D objects are relatively hard to detect due to varying environment introducing factors like insufficient lighting or even occlusion, objects with pre-defined shapes are often easy to track, leveraging geometric constraints. Curved images, with flexible dimensions but a confined shape, are essential shapes often targeted in 3D tracking. Traditionally, proprietary algorithms often require specific curvature measures as the input along with the original flattened images to enable pose estimation for a single image target. In this paper, we propose a pipeline that can detect several logo images simultaneously and only requires the original images as the input, unlocking more effects in downstream fields such as Augmented Reality (AR).

* 12th International Conference on Signal, Image Processing and Pattern Recognition (SIPP 2024)

Via

Access Paper or Ask Questions