4.6 Article

An adaptive loss weighting multi-task network with attention-guide proposal generation for small size defect inspection

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Software Engineering

Exploiting emotional concepts for image emotion recognition

Hansen Yang et al.

Summary: The research proposes a novel method for image emotion recognition, leveraging emotional concepts as intermediaries to connect images and emotions by organizing the relationship between concepts and emotions in the form of a knowledge graph. By exploring the relation between images and emotions in the semantic embedding space and using a multi-task learning deep model, the method successfully recognizes image emotions from a visual perspective, with the fusion strategy showing promising experimental results.

VISUAL COMPUTER (2023)

Article Geochemistry & Geophysics

SSPNet: Scale Selection Pyramid Network for Tiny Person Detection From UAV Images

Mingbo Hong et al.

Summary: This article proposes a scale selection pyramid network (SSPNet) for tiny person detection, which includes three components: context attention module (CAM), scale enhancement module (SEM), and scale selection module (SSM). The combination of these modules enhances feature representation for improved target detection performance.

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2022)

Article Computer Science, Artificial Intelligence

Two-Stage Copy-Move Forgery Detection With Self Deep Matching and Proposal SuperGlue

Yaqi Liu et al.

Summary: In this paper, a novel two-stage framework for copy-move forgery detection is proposed. The framework unifies end-to-end deep matching and keypoint matching by obtaining highly suspected proposals, leading to optimized detection results. Experiments demonstrate the effectiveness of the proposed framework.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Software Engineering

Fabric defect detection based on low-rank decomposition with structural constraints

Guohua Liu et al.

Summary: This paper proposes a fabric defect detection method based on low-rank decomposition with structural constraints. The method extracts energy features and constructs a fusion image to highlight defective regions, then builds a new low-rank decomposition model with structured sparsity-inducing norm introduced, and obtain the defect detection result through thresholding the sparse part. Experimental comparisons show the superiority of the proposed method over several state-of-the-art fabric defect detection methods.

VISUAL COMPUTER (2022)

Article Automation & Control Systems

A Survey of the Four Pillars for Small Object Detection: Multiscale Representation, Contextual Information, Super-Resolution, and Region Proposal

Guang Chen et al.

Summary: This article presents the first-ever survey of recent studies in deep learning-based small object detection. It provides an overview of the basic elements of small object detection, state-of-the-art datasets, performance of different methods, and the latest small object detection networks. The article also discusses promising directions and tasks for future work in small object detection.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2022)

Article Computer Science, Software Engineering

Sparse Attention Module for optimizing semantic segmentation performance combined with a multi-task feature extraction network

Min Jiang et al.

Summary: The paper proposes a Sparse Attention Model combined with a powerful multi-task feature extraction network to reduce computing resource consumption in semantic segmentation. By using a Class Attention Module, the model ensures that query vectors capture dense contextual information efficiently.

VISUAL COMPUTER (2022)

Article Radiology, Nuclear Medicine & Medical Imaging

Classification of Glaucoma Stages Using Image Empirical Mode Decomposition from Fundus Images

Deepak Parashar et al.

Summary: In this study, a Computer-Aided Diagnosis (CAD) method using Image Empirical Mode Decomposition (IEMD) was proposed for the classification of glaucoma stages. The preprocessed fundus photographs were decomposed into different Intrinsic Mode Functions (IMFs) to capture the pixel variations, and significant texture-based descriptors were computed from the IMFs. Dimensionality reduction using Principal Component Analysis (PCA) and feature ranking using Analysis of Variance (ANOVA) were employed. The LS-SVM classifier was used for glaucoma stage classification, achieving a high classification accuracy of 94.45% on the RIM-ONE r12 database.

JOURNAL OF DIGITAL IMAGING (2022)

Article Computer Science, Software Engineering

LE-MSFE-DDNet: a defect detection network based on low-light enhancement and multi-scale feature extraction

Weihua Hu et al.

Summary: This paper proposes a defect detection network based on low-light enhancement and multi-scale feature extraction, which introduces two blocks for low-light enhancement and combining channel dependencies for multi-scale feature extraction. This network can accurately locate defects of different scales in complex scenes, outperforming the state-of-the-art method for surface defect detection.

VISUAL COMPUTER (2022)

Article Engineering, Electrical & Electronic

Viewing Behavior Supported Visual Saliency Predictor for 360 Degree Videos

Yucheng Zhu et al.

Summary: This research aims to model visual attention in virtual reality (VR) to enhance user experience quality. By constructing datasets and proposing prediction frameworks, the researchers conducted experiments on panoramic videos and presented traditional and CNN-based models. The results demonstrate the significance of instantaneous viewing behavior in VR experiences.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

Multi-Task Learning for Dense Prediction Tasks: A Survey

Simon Vandenhende et al.

Summary: With the advent of deep learning, dense prediction tasks have significantly improved. Recent multi-task learning techniques have shown promising results by jointly tackling multiple tasks. This survey provides a comprehensive view on state-of-the-art deep learning approaches for multi-task learning in computer vision, with a focus on dense prediction tasks.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Engineering, Electrical & Electronic

Efficient Fused-Attention Model for Steel Surface Defect Detection

Ching-Chi Yeung et al.

Summary: Steel surface defect detection is a crucial task in manufacturing. This article proposes a fused-attention network (FANet) to address the challenges of scale variations, shape variations, and detection efficiency in defect detection. The proposed method achieves state-of-the-art performance on two steel surface defect detection datasets by applying an attention mechanism, an adaptively balanced feature fusion method, and a fused-attention module to improve accuracy and speed.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2022)

Article Computer Science, Theory & Methods

Screen Content Quality Assessment: Overview, Benchmark, and Beyond

Xiongkuo Min et al.

Summary: This article provides a systematic and timely review on the research field of screen content quality assessment, covering background, characteristics, methodologies and measures, state-of-the-art evaluation, generalizations to QoE assessment, and unresolved challenges and future research directions.

ACM COMPUTING SURVEYS (2022)

Article Computer Science, Information Systems

Fine localization and distortion resistant detection of multi-class barcode in complex environments

Jiahe Zhang et al.

Summary: The paper proposes a region-based end-to-end network to precisely localize and classify 1D and 2D barcodes in complex environments. Two special layers, quadrilateral regression layer and Multi-scale Spatial Pyramid Pooling layer, are designed to improve the accuracy of barcode detection. Extensive experiments validate the effectiveness of the proposed layers in resisting distortions and serving as a preprocessor for QR code decoding.

MULTIMEDIA TOOLS AND APPLICATIONS (2021)

Article Geochemistry & Geophysics

An Anchor-Free Method Based on Feature Balancing and Refinement Network for Multiscale Ship Detection in SAR Images

Jiamei Fu et al.

Summary: A novel detection method named FBR-Net is proposed in this article, which achieves efficient detection of multiscale SAR ships in complex scenes by eliminating the anchor effect, balancing multiple features, and refining object features.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2021)

Article Environmental Sciences

A Multi-Scale Spatial Attention Region Proposal Network for High-Resolution Optical Remote Sensing Imagery

Ruchan Dong et al.

Summary: In this study, a multi-scale spatial attention region proposal network (MSA-RPN) is proposed for high-resolution optical remote sensing imagery, which focuses on improving the recall rate for small targets in object detection and achieves higher accuracy.

REMOTE SENSING (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Yu Quan et al.

Summary: This paper studies and analyzes the neural network of object detection algorithm in order to improve the performance of two-stage object detection and consider the importance of scene and semantic information for visual recognition. It proposes a scene level region proposal self-attention object detection model based on depth separable convolution, reconstructs the scene-level region proposal self-attention module, and constructs a deep separable convolutional network module to enhance the overall performance of the model.

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Article Engineering, Electrical & Electronic

Automatic Classification of Glaucoma Stages Using Two-Dimensional Tensor Empirical Wavelet Transform

Deepak Parashar et al.

Summary: Glaucoma is a chronic eye disease that may cause permanent vision loss. Existing automatic classification methods are not efficient for early-stage glaucoma detection. A novel glaucoma classification method based on 2D-T-EWT was proposed in this study, achieving high classification accuracy.

IEEE SIGNAL PROCESSING LETTERS (2021)

Article Computer Science, Artificial Intelligence

Defect identification of wind turbine blades based on defect semantic features with transfer feature extractor

Yajie Yu et al.

NEUROCOMPUTING (2020)

Article Automation & Control Systems

Deep-Learning-Based Small Surface Defect Detection via an Exaggerated Local Variation-Based Generative Adversarial Network

Jian Lian et al.

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2020)

Article Engineering, Electrical & Electronic

An End-to-End Steel Surface Defect Detection Approach via Fusing Multiple Hierarchical Features

Yu He et al.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2020)

Review Computer Science, Information Systems

Perceptual image quality assessment: a survey

Zhai Guangtao et al.

SCIENCE CHINA-INFORMATION SCIENCES (2020)

Article Engineering, Electrical & Electronic

Tiny-BDN: An Efficient and Compact Barcode Detection Network

Jun Jia et al.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2020)

Article Engineering, Electrical & Electronic

Detecting Small Objects Using a Channel-Aware Deconvolutional Network

Kaiwen Duan et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2020)

Article Engineering, Electrical & Electronic

Small Object Detection in Unmanned Aerial Vehicle Images Using Feature Fusion and Scaling-Based Single Shot Detector With Spatial Context Analysis

Xi Liang et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2020)

Article Computer Science, Information Systems

The Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images

Yucheng Zhu et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2020)

Article Computer Science, Artificial Intelligence

Study of Subjective and Objective Quality Assessment of Audio-Visual Signals

Xiongkuo Min et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

A Multimodal Saliency Model for Videos With High Audio-Visual Correspondence

Xiongkuo Min et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)

Article Computer Science, Artificial Intelligence

HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

Rajeev Ranjan et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)

Article Automation & Control Systems

A Two-Stage Data-Driven Approach for Image-Based Wind Turbine Blade Crack Inspections

Long Wang et al.

IEEE-ASME TRANSACTIONS ON MECHATRONICS (2019)

Article Computer Science, Information Systems

EMBDN: An Efficient Multiclass Barcode Detection Network for Complicated Environments

Jun Jia et al.

IEEE INTERNET OF THINGS JOURNAL (2019)

Proceedings Paper Computer Science, Artificial Intelligence

AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects

Christian Wilms et al.

COMPUTER VISION - ACCV 2018, PT II (2019)

Article Computer Science, Artificial Intelligence

Unified Blind Quality Assessment of Compressed Natural, Graphic, and Screen Content Images

Xiongkuo Min et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2017)

Article Computer Science, Artificial Intelligence

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Improving Small Object Detection

Harish Krishna et al.

PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR) (2017)

Review Behavioral Sciences

The role of context in object recognition

Aude Oliva et al.

TRENDS IN COGNITIVE SCIENCES (2007)