Correlation dendrograms based on word adjacency co-occurrence language network parameters

  • Jan Parvin Bat-og Zoluaga National Institute of Physics, University of the Philippines Diliman
  • Giovanni Tapang National Institute of Physics, University of the Philippines Diliman

Abstract

Word adjacency co-occurrence language networks were constructed from Chinese, English, Filipino, German, Japanese, and Korean translations of the first 20 chapters of Genesis in the Bible, and the Universal Declaration of Human Rights. The parameters observed from these networks were compared using Pearson's r to measure similarity. The languages were sorted based on correlation values, and dendrograms were constructed to show similarities in network structure. The Korean language network had weaker correlations to other language networks in comparison to the correlations among the other networks, particularly considering average clustering coefficient and network diameter values.

Published
2019-05-24
How to Cite
[1]
J. P. Zoluaga and G. Tapang. Correlation dendrograms based on word adjacency co-occurrence language network parameters, Proceedings of the Samahang Pisika ng Pilipinas 37, SPP-2019-PB-26 (2019). URL: https://paperview.spp-online.org/proceedings/article/view/SPP-2019-PB-26.