☆ 3.8 Article

Teaching Tale Types to a Computer: A First Experiment with the Annotated Folktales Collection

FABULA (2023)

期刊

FABULA

卷 64, 期 1-2, 页码 92-106

出版社

WALTER DE GRUYTER GMBH

DOI: 10.1515/fabula-2023-0005

关键词

类别

Folklore

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Computational motif detection in folk narratives is a challenging task due to the fluid nature of motifs and the lack of adequate training data. This study uses the Support Vector Machine algorithm on a test collection of annotated folktales to predict text membership in different categories. The results show high F-1 scores for most tale types, except for type 275, which has a low precision rate despite a perfect recall rate.

Computational motif detection in folk narratives is an unresolved problem, partly because motifs are formally fluid, and because test collections to teach machine learning algorithms are not generally available or big enough to yield robust predictions for expert confirmation. As a result, standard tale typology based on texts as motif strings renders its computational reproduction an automatic classification exercise. In this brief communication, to report work in progress we use the Support Vector Machine algorithm on the ten best populated classes of the Annotated Folktales test collection, to predict text membership in their internationally accepted categories. The classification result was evaluated using recall, precision, and F-1 scores. The F-1 score was in the range 0.8-1.0 for all the selected tale types except for type 275 (The Race between Two Animals), which, although its recall rate was 1.0, suffered from a low precision.

Teaching Tale Types to a Computer: A First Experiment with the Annotated Folktales Collection

期刊

FABULA

出版社

WALTER DE GRUYTER GMBH

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Teaching Tale Types to a Computer: A First Experiment with the Annotated Folktales Collection

期刊

FABULA

出版社

WALTER DE GRUYTER GMBH

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文