3.8 Proceedings Paper

Parallel Rule Discovery from Large Datasets by Sampling

Related references

Note: Only part of the references are listed.
Article Computer Science, Theory & Methods

An Overview of End-to-End Entity Resolution for Big Data

Vassilis Christophides et al.

Summary: Entity Resolution (ER) is a critical task for improving data quality and reliability of data analytics by identifying different descriptions referring to the same real-world entity. Despite decades of research, ER remains a challenging problem. This survey highlights the novel aspects of resolving Big Data entities while satisfying multiple Big Data characteristics, and provides an overview of basic concepts, processing steps, and execution strategies proposed by different communities.

ACM COMPUTING SURVEYS (2021)

Article Computer Science, Information Systems

Efficient Discovery of Matching Dependencies

Philipp Schirmer et al.

ACM TRANSACTIONS ON DATABASE SYSTEMS (2020)

Proceedings Paper Computer Science, Information Systems

A Statistical Perspective on Discovering Functional Dependencies in Noisy Data

Yunjia Zhang et al.

SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (2020)

Article Computer Science, Information Systems

Pattern Functional Dependencies for Data Cleaning

Abdulhakim Qahtan et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2020)

Article Computer Science, Information Systems

Approximate Denial Constraints

Ester Livshits et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2020)

Article Computer Science, Information Systems

Wander Join and XDB: Online Aggregation via Random Walks

Feifei Li et al.

ACM TRANSACTIONS ON DATABASE SYSTEMS (2019)

Article Computer Science, Information Systems

Discovery of Approximate (and Exact) Denial Constraints

Eduardo H. M. Pena et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2019)

Article Computer Science, Information Systems

Secure Multi-Party Functional Dependency Discovery

Chang Ge et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2019)

Proceedings Paper Computer Science, Information Systems

HoloDetect: Few-Shot Learning for Error Detection

Alireza Heidari et al.

SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (2019)

Article Computer Science, Information Systems

Distributed implementations of dependency discovery algorithms

Hemant Saxena et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2019)

Article Computer Science, Information Systems

Efficient Denial Constraint Discovery with Hydra

Tobias Bleifuss et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2017)

Article Computer Science, Information Systems

Synthesizing Entity Matching Rules by Examples

Rohit Singh et al.

PROCEEDINGS OF THE VLDB ENDOWMENT (2017)

Article Computer Science, Artificial Intelligence

A new approach for generating efficient sample from market basket data

B. Chandra et al.

EXPERT SYSTEMS WITH APPLICATIONS (2011)

Article Computer Science, Artificial Intelligence

Locality sensitive hashing for sampling-based algorithms in association rule mining

Chyouhwa Chen et al.

EXPERT SYSTEMS WITH APPLICATIONS (2011)

Article Computer Science, Artificial Intelligence

Discovering Conditional Functional Dependencies

Wenfei Fan et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2011)

Article Computer Science, Hardware & Architecture

Dynamic constraints for record matching

Wenfei Fan et al.

VLDB JOURNAL (2011)

Article Computer Science, Information Systems

A new sampling technique for association rule mining

Basel A. Mahafzah et al.

JOURNAL OF INFORMATION SCIENCE (2009)

Article Computer Science, Information Systems

Conditional functional dependencies for capturing data inconsistencies

Wenfei Fan et al.

ACM TRANSACTIONS ON DATABASE SYSTEMS (2008)

Review Physics, Multidisciplinary

Power laws, Pareto distributions and Zipf's law

MEJ Newman

CONTEMPORARY PHYSICS (2005)

Article Computer Science, Artificial Intelligence

Identifying approximate itemsets of interest in large databases

CQ Zhang et al.

APPLIED INTELLIGENCE (2003)