4.6 Article

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

Exploiting enhanced and robust RGB-D face representation via progressive multi-modal learning

Yizhe Zhu et al.

Summary: This paper proposes a progressive multi-modal fusion framework to improve and enhance RGB-D facial recognition using low-cost RGB-D cameras. The framework includes a depth enhancement module, feature extraction and aggregation steps, as well as masked modeling and inter-modal feature interaction modules. Experimental results demonstrate the superior performance and robustness of the proposed solution compared to other face recognition methods on four challenging benchmark databases.

PATTERN RECOGNITION LETTERS (2023)

Article Computer Science, Information Systems

Deepfake Video Detection via Predictive Representation Learning

Shiming Ge et al.

Summary: This paper proposes a predictive representative learning approach called Latent Pattern Sensing for deepfake video detection. The approach captures semantic change characteristics in deepfake videos by analyzing temporal inconsistencies and uses a unified framework to describe latent patterns across frames. Experimental results show the effectiveness of the approach, achieving high AUC scores on benchmark datasets.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2022)

Article Automation & Control Systems

DeepFakes detection across generations: Analysis of facial regions, fusion, and performance evaluation

Ruben Tolosana et al.

Summary: This study provides an in-depth analysis of the performance of fake detection in both 1st and 2nd generations of DeepFakes. Two different approaches are considered, and fusion techniques are applied to increase the robustness of the detectors. The findings highlight the effectiveness of using specific facial regions and fusion techniques, while emphasizing the need for further research on detecting unseen attacks.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2022)

Article Engineering, Electrical & Electronic

Student Network Learning via Evolutionary Knowledge Distillation

Kangkai Zhang et al.

Summary: Knowledge distillation is an effective way of transferring knowledge through teacher-student learning. However, traditional methods often result in a large capability gap between the teacher and student networks. Recent research has shown that a small capability gap can facilitate knowledge transfer. In this paper, we propose an evolutionary knowledge distillation approach that learns an evolving teacher to improve the effectiveness of knowledge transfer. We introduce simple guided modules to enhance intermediate knowledge representation and mimicking. Extensive experiments demonstrate the effectiveness and adaptability of our approach in low-resolution and few-sample scenarios.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Engineering, Electrical & Electronic

Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network

Juan Hu et al.

Summary: This paper discusses the development of Deepfake videos and their application in compressed videos. By analyzing the frame-level and temporality-level of compressed Deepfake videos, a two-stream method is proposed, which can better identify such videos.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

Patch-DFD: Patch-based end-to-end DeepFake discriminator

Miaomiao Yu et al.

Summary: This paper introduces a novel deep forgery detector called Patch-DFD, which uses a patch-based solution and local voting strategy to more efficiently and accurately identify fake faces generated by deep generative models.

NEUROCOMPUTING (2022)

Article Engineering, Electrical & Electronic

Efficient Video Grounding With Which-Where Reading Comprehension

Jialin Gao et al.

Summary: Video grounding is a crucial technique for localizing temporal moments related to language descriptions. This paper presents an efficient framework that imitates the reading comprehension process to narrow the decision space, achieving accurate video grounding.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

Learning to localize image forgery using end-to-end attention network

Iyyakutti Iyappan Ganapathi et al.

Summary: This article introduces a novel algorithm for localizing forged regions in digital images. The proposed algorithm utilizes channel attention convolutional blocks to infer the forged regions and extracts features in the spatial and frequency domains. Rigorous experiments demonstrate that the algorithm outperforms existing methods on benchmark datasets for localizing various manipulations.

NEUROCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

A comprehensive overview of Deepfake: Generation, detection, datasets, and opportunities

Jia Wen Seow et al.

Summary: When used maliciously, deepfake technology can have detrimental implications on politics and society. Due to a lack of specific laws addressing deepfakes, detection plays a vital role. This paper provides a comprehensive overview of deepfake generation and detection, as well as discusses trends for future research.

NEUROCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

QDL-CMFD: A Quality-independent and deep Learning-based Copy- Move image forgery detection method

Mehrad Aria et al.

Summary: This study presents an image forgery detection method called QDL-CMFD, which is based on deep learning. QDL-CMFD utilizes generative adversarial networks for image quality enhancement and convolutional neural networks for forgery detection. Unlike existing methods, QDL-CMFD is able to simultaneously detect multiple forged areas and determine the source and target of the forgery. Experimental results demonstrate its excellent performance in detecting low-quality forged images and small areas.

NEUROCOMPUTING (2022)

Proceedings Paper Computer Science, Information Systems

You Need to Read Again: Multi-granularity Perception Network for Moment Retrieval in Videos

Xin Sun et al.

Summary: This study proposes a Multi-Granularity Perception Network (MGPN) for moment retrieval in videos. By integrating human reading strategies and introducing fine-grained feature encoder and conditioned interaction module, the network is able to more accurately retrieve relevant video moments.

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) (2022)

Article Computer Science, Artificial Intelligence

Intensity enhancement via GAN for multimodal face expression recognition

Hongyu Yang et al.

Summary: This paper proposes a novel multimodal approach based on GAN that jointly models intensity enhancement and expression recognition, resulting in improved FER performance. Experimental results on multiple datasets validate the effectiveness of the method in both low expression intensity and general FER scenarios.

NEUROCOMPUTING (2021)

Article Computer Science, Hardware & Architecture

Generative Adversarial Networks

Ian Goodfellow et al.

COMMUNICATIONS OF THE ACM (2020)

Article Computer Science, Software Engineering

Deferred Neural Rendering: Image Synthesis using Neural Textures

Justus Thies et al.

ACM TRANSACTIONS ON GRAPHICS (2019)

Article Computer Science, Software Engineering

Synthesizing Obama: Learning Lip Sync from Audio

Supasorn Suwajanakorn et al.

ACM TRANSACTIONS ON GRAPHICS (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Densely Connected Convolutional Networks

Gao Huang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Xception: Deep Learning with Depthwise Separable Convolutions

Francois Chollet

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Exploiting Spatial Structure for Localizing Manipulated Image Regions

Jawadul H. Bappy et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)