4.5 Article

Content Based Automated File Organization Using Machine Learning Approaches

Journal

CMC-COMPUTERS MATERIALS & CONTINUA
Volume 73, Issue 1, Pages 1927-1942

Publisher

TECH SCIENCE PRESS
DOI: 10.32604/cmc.2022.029400

Keywords

File organization; natural language processing; machine learning

Ask authors/readers for more resources

In the world of big data, organizing files based on their similarities is a challenging task. This research proposes an automated file organization system that categorizes files based on their content similarities using supervised and unsupervised machine learning approaches, and demonstrates its effectiveness and efficiency in real-world experiments.
In the world of big data, it???s quite a task to organize different files based on their similarities. Dealing with heterogeneous data and keeping a record of every single file stored in any folder is one of the biggest problems encountered by almost every computer user. Much of file management related tasks will be solved if the files on any operating system are somehow categorized according to their similarities. Then, the browsing process can be performed quickly and easily. This research aims to design a system to automatically organize files based on their similarities in terms of content. The proposed methodology is based on a novel strategy that employs the charactaristics of both supervised and unsupervised machine learning approaches for learning categories of digital files stored on any computer system. The results demonstrate that the proposed architecture can effectively and efficiently address the file organization challenges using real-world user files. The results suggest that the proposed system has great potential to automatically categorize almost all of the user files based on their content. The proposed system is completely automated and does not require any human effort in managing the files and the task of file organization become more efficient as the number of files grows.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available