NFDI4DS | UHH-SEMS - Publication Details

Language clustering with word co-occurrence networks based on parallel texts

0602 languages and literature 06 humanities and the arts General

DOI: 10.1007/s11434-013-5711-8 Publication Date: 2013-03-22T10:13:12Z

Abstract Supplemental Material References Cited by

AUTHORS (2)

HaiTao Liu

Jin Cong

ABSTRACT

This study investigates the feasibility of applying complex networks to fine-grained language classification and of employing word co-occurrence networks based on parallel texts as a substitute for syntactic dependency networks in complex-network-based language classification. 14 word co-occurrence networks were constructed based on parallel texts of 12 Slavic languages and 2 non-Slavic languages, respectively. With appropriate combinations of major parameters of these networks, cluster analysis was able to distinguish the Slavic languages from the non-Slavic and correctly group the Slavic languages into their respective sub-branches. Moreover, the clustering could also capture the genetic relationships of some of these Slavic languages within their sub-branches. The results have shown that word co-occurrence networks based on parallel texts are applicable to fine-grained language classification and they constitute a more convenient substitute for syntactic dependency networks in complex-network-based language classification.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (23)

CITATIONS (72)

EXTERNAL LINKS

CROSSREF - Publications OPENAIRE - Products

PlumX Metrics

Language clustering with word co-occurrence networks based on parallel texts

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....