4.7 Article

3D human body modeling with orthogonal human mask image based on multi-channel Swin transformer architecture

期刊

IMAGE AND VISION COMPUTING
卷 137, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.imavis.2023.104795

关键词

Body shape space; Human shape estimation; Orthogonal human mask; Swin transformer; Body shape classification

向作者/读者索取更多资源

A new method based on Swin transformer is proposed for reconstructing 3D human body shape from human orthogonal mask image, overcoming the limitations of reconstruction based on RGB images and naked 3D scanning. The method represents the reconstruction problem as solving regression mapping function, and innovatively represents the regression function as a piecewise function with the human body shape as the segmentation criterion.
The reconstruction based on RGB images of dressed human body lacks the shape information of the human body under clothing, while the naked 3D human body scanning will violate the user's privacy. To overcome these limitations, a new method, based on Swin transformer (Swin-T), for reconstructing 3D human body shape from human orthogonal mask image is proposed. Its core is to express the reconstruction problem as solving regression mapping function. A fast body shape type classification method based on the human front mask is proposed. The regression function is innovatively represented as a piecewise function, with the body shape of the human body as the segmentation criterion. A multi-channel Swin-T architecture is designed, which can not only extract features from front and side mask images, but also their mixed features to construct the regression mapping function. Different body types for different genders are predicted with separate regression function to help estimate an accurate human model. Extensive experimental results show that the proposed method effectively achieves visually realistic and accurate body reconstruction, and significantly outperforms the current state-ofthe-art methods. In addition, the classification of body types can compensate for the errors caused by partial clothing laxity in practical applications, which is beneficial for users to obtain a more accurate 3D human model.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据