4.5 Article

BTSwin-Unet: 3D U-shaped Symmetrical Swin Transformer-based Network for Brain Tumor Segmentation with Self-supervised Pre-training

Related references

Note: Only part of the references are listed.
Article Computer Science, Software Engineering

PVT v2: Improved baselines with Pyramid Vision Transformer

Wenhai Wang et al.

Summary: This work presents the improved Pyramid Vision Transformer v2 (PVT v2) by adding three designs, achieving significant improvements in fundamental vision tasks. PVT v2 performs comparably or better than recent work such as the Swin transformer.

COMPUTATIONAL VISUAL MEDIA (2022)

Proceedings Paper Computer Science, Artificial Intelligence

SimMIM: a Simple Framework for Masked Image Modeling

Zhenda Xie et al.

Summary: This paper presents SimMIM, a simple framework for masked image modeling. The study shows that the simple designs of each component have strong representation learning performance and achieve high accuracy on ImageNet-1K. Additionally, this approach addresses the data-hungry issue faced by large-scale model training.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Masked Autoencoders Are Scalable Vision Learners

Kaiming He et al.

Summary: This paper presents a self-supervised learning method for computer vision based on masked autoencoders. By masking a portion of the input image and reconstructing the missing pixels, large models can be trained efficiently and effectively. The approach achieves high generalization performance and outperforms supervised pretraining in transfer learning tasks.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows

Xiaoyi Dong et al.

Summary: CSWin Transformer is an efficient and effective Transformer-based backbone for general-purpose vision tasks. It achieves competitive performance by using the Cross-Shaped Window self-attention mechanism, Locally-enhanced Positional Encoding, and a hierarchical structure.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

UNETR: Transformers for 3D Medical Image Segmentation

Ali Hatamizadeh et al.

Summary: Fully Convolutional Neural Networks (FCNNs) have been successful in medical image segmentation, but their limited ability to learn long-range dependencies is a challenge. Inspired by transformers in NLP, we propose a novel architecture called UNet Transformers (UNETR) to redefine volumetric medical image segmentation as a sequence prediction problem. By combining transformers and U-shaped network design in the encoder and decoder, we effectively capture global information and achieve semantic segmentation output.

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) (2022)

Article Computer Science, Artificial Intelligence

Brain Tumor Segmentation Using Deep Learning and Fuzzy K-Means Clustering for Magnetic Resonance Images

R. Pitchai et al.

Summary: This paper proposes a methodology for brain tumor segmentation using a combination of Artificial Neural Network and Fuzzy K-means algorithm, involving noise removal, attribute extraction and selection, classification, and segmentation stages. The proposed approach achieves high accuracy and sensitivity in segmenting brain tumor regions.

NEURAL PROCESSING LETTERS (2021)

Article Computer Science, Artificial Intelligence

Human Activity Recognition by Using Different Deep Learning Approaches for Wearable Sensors

Cagatay Berke Erdas et al.

Summary: The use of wearable sensors for activity recognition tasks has become widespread, with deep learning algorithms such as Convolutional Neural Networks proving to be effective. In this study, data from accelerometer sensors was used to feed deep learning algorithms, forming consecutive raw samples of the same activity to capture patterns and preserve the continuous structure of movement.

NEURAL PROCESSING LETTERS (2021)

Article Computer Science, Artificial Intelligence

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Liang-Chieh Chen et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2018)