Chapter Black box approaches to genealogical classification and their shortcomings

In the past 20 years, the application of quantitative methods in historical linguistics has received a lot of attention. Traditional historical linguistics relies on the comparative method in order to determine the genealogical related-ness of languages. More recent quantitative approaches attempt t...

詳細記述

保存先:
書誌詳細
第一著者: Prokić, Jelena (auth)
その他の著者: Moran, Steven (auth), Saxena, Anju (編集者), Borin, Lars (編集者)
フォーマット: 電子媒体 図書の章
言語:英語
出版事項: Berlin/Boston De Gruyter 2013
主題:
オンライン・アクセス:OAPEN Library: download the publication
OAPEN Library: description of the publication
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!
その他の書誌記述
要約:In the past 20 years, the application of quantitative methods in historical linguistics has received a lot of attention. Traditional historical linguistics relies on the comparative method in order to determine the genealogical related-ness of languages. More recent quantitative approaches attempt to automate this process, either by developing computational tools that complement the comparative method (Steiner et al. 2010) or by applying fully automatized methods that take into account very limited or no linguistic knowledge, e.g. the Levenshtein approach. The Levenshtein method has been extensively used in dialectometry to measure the distances between various dialects (Kessler 1995; Heeringa 2004; Nerbonne 1996). It has also been frequently used to analyze the relatedness between languages, such as Indo-European (Serva and Petroni 2008; Blanchard et al. 2010), Austronesian (Petroni and Serva 2008), and a very large sample of 3002 languages (Holman 2010). In this paper we will examine the performance of the Levenshtein distance against n-gram models and a zipping approach by applying these methods to the same set of language data.
ISBN:9783110305258.429
9783110488081
アクセス:Open Access