☆ 4.0 Article

Explorations in automated language classification

FOLIA LINGUISTICA (2008)

Journal

FOLIA LINGUISTICA

Volume 42, Issue 2, Pages 331-354

Publisher

WALTER DE GRUYTER GMBH

DOI: 10.1515/FLIN.2008.331

Keywords

language classification; lexicostatistics; word stabilities; Swadesh list

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

An earlier paper, to which some authors of the present paper have contributed (Brown et al. 2008), describes a method for automating language classification based on the 100-item referent list of Swadesh (1955). Here we discuss a refinement of the method, involving calculation of relative stabilities of list items and reduction of the list to a shorter one by eliminating least stable items. The result is a 40-item referent list. The method for determining stabilities is explained, as well as a method for comparing the classificatory performance of different-sized reduced lists with that of the full 100-item list. A statistical investigation of the relationship of lexical similarity of languages to their geographical proximity is presented. Finally, we test the possibility that information involving typological features of languages can be combined with lexical data to enhance classificatory accuracy.

Explorations in automated language classification

Journal

FOLIA LINGUISTICA

Publisher

WALTER DE GRUYTER GMBH

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Explorations in automated language classification

Journal

FOLIA LINGUISTICA

Publisher

WALTER DE GRUYTER GMBH

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper