4.7 Article

DPTVAE: Data-driven prior-based tabular variational autoencoder for credit data synthesizing

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

Protecting the anonymity of online users through Bayesian data synthesis

Matthew J. Schneider et al.

Summary: Privacy concerns arise when online users of popular user-generated content platforms are identified through a combination of their structured data and textual content. To address this, we propose a Bayesian sequential synthesis methodology for organizations to share structured data along with textual content. Our approach allows platforms to control the privacy level of their released data using a single shrinkage parameter. Our results demonstrate that our synthesis strategy reduces the probability of user identification while preserving much of the textual content in the structured data. Moreover, we find that sharing protected data offers greater value than sharing the unprotected structured data and textual content separately. These findings encourage UGC platforms to protect online user anonymity by using synthetic data.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Deep Neural Networks and Tabular Data: A Survey

Vadim Borisov et al.

Summary: This work provides an overview of state-of-the-art deep learning methods for tabular data, covering data transformations, specialized architectures, and regularization models. It also discusses deep learning approaches for generating tabular data and strategies for explaining deep models on tabular data. The results suggest that gradient-boosted tree ensembles still outperform deep learning models on supervised learning tasks for tabular data, indicating a stagnation in the research progress of competitive deep learning models in this area. This study serves as a valuable starting point for researchers and practitioners interested in deep learning with tabular data.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

A Diverse Domain Generative Adversarial Network for Style Transfer on Face Photographs

Rabia Tahir et al.

Summary: The paper presents a Diverse Domain Generative Adversarial Network (DD-GAN) that performs fast diverse domain style translation on human face images. The research work is highly efficient and focused on applying different attractive and unique painting styles to human photographs while keeping the content preserved after translation.

INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

ED-Dehaze Net: Encoder and Decoder Dehaze Network

Hongqi Zhang et al.

Summary: This study introduces a novel end-to-end dehazing method, using Encoder and Decoder Dehaze Network, trained by a Generator and a Discriminator, achieving high-quality dehazing of hazy images.

INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE (2022)

Article Mathematics, Interdisciplinary Applications

Synthetic Data

Trivellore E. Raghunathan

Summary: Synthetic data sets are an attractive framework to provide widespread access to data for analysis while mitigating privacy and confidentiality concerns. This article aims to review various methods for generating and analyzing synthetic data sets, inferential justification, limitations of the approaches, and future research directions.

ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 8, 2021 (2021)

Article Computer Science, Artificial Intelligence

Learning latent representations of bank customers with the Variational Autoencoder

Rogelio A. Mancisidor et al.

Summary: This research demonstrates that steering data representations in the latent space of the Variational Autoencoder (VAE) is possible using a semi-supervised learning framework and Weight of Evidence (WoE) method. The proposed method successfully learns a well-defined clustering structure of data representation, capturing customers' creditworthiness.

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Article Computer Science, Information Systems

Relevance aggregation for neural networks interpretability and knowledge discovery on tabular data

Bruno Iochins Grisci et al.

Summary: The study introduces a relevance aggregation algorithm that combines the relevance computed from multiple samples by a neural network to generate scores for each input feature. Two visualization methods for learned patterns were presented to enhance model comprehension. The method accurately identifies the most important features for network predictions.

INFORMATION SCIENCES (2021)

Article Business, Finance

COVID-19 pandemic risk and probability of loan default: evidence from marketplace lending market

Asror Nigmonov et al.

Summary: The research shows that the COVID-19 pandemic has a significant negative impact on the marketplace lending sector, leading to an increase in default risk, especially in May and June of 2020. The impact of COVID-19 risk is greater for borrowers with lower credit ratings and in countries with lower levels of FinTech adoption.

FINANCIAL INNOVATION (2021)

Article Engineering, Biomedical

MedGAN: Medical image translation using GANs

Karim Armanious et al.

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS (2020)

Article Computer Science, Hardware & Architecture

Generative Adversarial Networks

Ian Goodfellow et al.

COMMUNICATIONS OF THE ACM (2020)

Article Computer Science, Artificial Intelligence

Improving classification accuracy using data augmentation on small data sets

Francisco J. Moreno-Barea et al.

EXPERT SYSTEMS WITH APPLICATIONS (2020)

Article Computer Science, Artificial Intelligence

k-means as a variational EM approximation of Gaussian mixture models

Joerg Luecke et al.

PATTERN RECOGNITION LETTERS (2019)

Article Computer Science, Interdisciplinary Applications

The synthesis of data from instrumented structures and physics-based models via Gaussian processes

Alastair Gregory et al.

JOURNAL OF COMPUTATIONAL PHYSICS (2019)

Article Engineering, Electrical & Electronic

Optimal data-based binning for histograms and histogram-based probability density models

Kevin H. Knuth

DIGITAL SIGNAL PROCESSING (2019)

Article Computer Science, Information Systems

The optimal combination of feature selection and data discretization: An empirical study

Chih-Fong Tsai et al.

INFORMATION SCIENCES (2019)

Article Computer Science, Artificial Intelligence

Effective data generation for imbalanced learning using conditional generative adversarial networks

Georgios Douzas et al.

EXPERT SYSTEMS WITH APPLICATIONS (2018)

Article Mathematics, Interdisciplinary Applications

Finite Mixture Models

Geoffrey J. McLachlan et al.

Annual Review of Statistics and Its Application (2018)

Article Computer Science, Information Systems

Data Synthesis based on Generative Adversarial Networks

Noseong Park et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2018)

Article Automation & Control Systems

Next-Generation Big Data Analytics: State of the Art, Challenges, and Future Research Topics

Zhihan Lv et al.

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2017)

Review Statistics & Probability

Variational Inference: A Review for Statisticians

David M. Blei et al.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2017)

Article Computer Science, Information Systems

PrivBayes: Private Data Release via Bayesian Networks

Jun Zhang et al.

ACM TRANSACTIONS ON DATABASE SYSTEMS (2017)

Article Computer Science, Information Systems

Data-intensive applications, challenges, techniques and technologies: A survey on Big Data

C. L. Philip Chen et al.

INFORMATION SCIENCES (2014)

Article Computer Science, Artificial Intelligence

A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning

Salvador Garcia et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2013)

Article Statistics & Probability

Towards Unrestricted Public Use Business Microdata: The Synthetic Longitudinal Business Database

Satkartar K. Kinney et al.

INTERNATIONAL STATISTICAL REVIEW (2011)

Article Computer Science, Information Systems

Hybrid microdata using microaggregation

Josep Domingo-Ferrer et al.

INFORMATION SCIENCES (2010)

Article Computer Science, Artificial Intelligence

An efficient k-means clustering algorithm:: Analysis and implementation

T Kanungo et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2002)