4.0 Article

Explorations in automated language classification

期刊

FOLIA LINGUISTICA
卷 42, 期 2, 页码 331-354

出版社

WALTER DE GRUYTER GMBH
DOI: 10.1515/FLIN.2008.331

关键词

language classification; lexicostatistics; word stabilities; Swadesh list

向作者/读者索取更多资源

An earlier paper, to which some authors of the present paper have contributed (Brown et al. 2008), describes a method for automating language classification based on the 100-item referent list of Swadesh (1955). Here we discuss a refinement of the method, involving calculation of relative stabilities of list items and reduction of the list to a shorter one by eliminating least stable items. The result is a 40-item referent list. The method for determining stabilities is explained, as well as a method for comparing the classificatory performance of different-sized reduced lists with that of the full 100-item list. A statistical investigation of the relationship of lexical similarity of languages to their geographical proximity is presented. Finally, we test the possibility that information involving typological features of languages can be combined with lexical data to enhance classificatory accuracy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据