4.5 Article

Prediction and Visualisation of SICONV Project Profiles Using Machine Learning

Journal

SYSTEMS
Volume 10, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/systems10060252

Keywords

accountability; machine learning; t-SNE; PCA; BPMN; SICONV; MLR3

Funding

  1. CNPq, Brazil [304818/2018-6, 305223/2014-3, 309525/2021-7]
  2. Ministry of Agriculture, Livestock and Supply-MAPA within Ministry of Agriculture
  3. Instituto Federal Goiano-IF Goiano [TED 05/2020-MAPA/IF Goiano]

Ask authors/readers for more resources

This study describes the conception and evaluation of a system for defining and predicting project profiles. By analyzing data from government-funded SICONV projects in Brazil, data clustering was achieved and ten project profiles were defined. Among multiple prediction models, the k-Nearest-Neighbor model performed the best with high accuracy.
Background: Inefficient use of public funds can have a negative impact on the lives of citizens. The development of machine learning-based technologies for data visualisation and prediction has opened the possibility of evaluating the accountability of publicly funded projects. Methods: This study describes the conception and evaluation of the architecture of a system that can be utilised for project profile definition and prediction. The system was used to analyse data from 20,942 System of Management of Agreements and Transfer Contracts (SICONV) projects in Brazil, which are government-funded projects. SICONV is a Brazilian Government initiative that records the entire life cycle of agreements, transfer contracts, and partnership terms, from proposal formalisation to final accountability. The projects were represented by seven variables, all of which were related to the timeline and budget of the project. Data statistics and clustering in a lower-dimensional space calculated using t-SNE were used to generate project profiles. Performance measures were used to test and compare several project-profile prediction models based on classifiers. Results: Data clustering was achieved, and ten project profiles were defined as a result. Among 25 prediction models, k-Nearest-Neighbor (kknn) was the one that yielded the highest accuracy (0.991 +/- 0.002). Conclusions: The system predicted SICONV project profiles accurately. This system can help auditors and citizens evaluate new and ongoing project profiles, identifying inappropriate public funding.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available