☆ 3.8 Proceedings Paper

SceneFormer: Indoor Scene Generation with Transformers

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021) (2021)

Journal

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021)

Volume -, Issue -, Pages 106-115

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/3DV53792.2021.00021

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study focuses on indoor scene generation using transformers, without relying on appearance information. By using selfattention and cross-attention mechanisms, the model can generate scenes faster and with similar or improved realism compared to existing methods, conditioned on room layout or text descriptions.

We address the task of indoor scene generation by generating a sequence of objects, along with their locations and orientations conditioned on a room layout. Large-scale indoor scene datasets allow us to extract patterns from user-designed indoor scenes, and generate new scenes based on these patterns. Existing methods rely on the 2D or 3D appearance of these scenes in addition to object positions, and make assumptions about the possible relations between objects. In contrast, we do not use any appearance information, and implicitly learn object relations using the selfattention mechanism of transformers. We show that our model design leads to faster scene generation with similar or improved levels of realism compared to previous methods. Our method is also flexible, as it can be conditioned not only on the room layout but also on text descriptions of the room, using only the cross-attention mechanism of transformers. Our user study shows that our generated scenes are preferred to the state-of-the-art FastSynth scenes 53.9% and 56.7% of the time for bedroom and living room scenes, respectively. At the same time, we generate a scene in 1.48 seconds on average, 20% faster than FastSynth.

SceneFormer: Indoor Scene Generation with Transformers

Journal

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021)

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

SceneFormer: Indoor Scene Generation with Transformers

Journal

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021)

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper