Toggle Main Menu Toggle Search

Open Access padlockePrints

Sura Length and Lexical Probability Estimation in Cluster Analysis of the Qur’an

Lookup NU author(s): Dr Hermann Moisl

Downloads


Abstract

Thabet [2005] applied cluster analysis to the Qur'an in the hope of generating a classification of the سور (suras) that is useful for understanding of its thematic structure. The result was positive, but variation in سورة (sura) length was a problem because clustering of the shorter سور was found to be unreliable. The present discussion addresses this problem in four parts. The first part summarizes Thabet's work. The second argues that unreliable clustering of the shorter سور is a consequence of poor estimation of lexical population probabilities in those سور. The third proposes a solution to the problem based on calculation of a minimum سورة length threshold using concepts from statistical sampling theory followed by selection of سور and lexical variables based on that threshold. The fourth applies the proposed solution to a reanalysis of the Qur'an.


Publication metadata

Author(s): Moisl H

Publication type: Article

Publication status: Published

Journal: ACM Transactions on Asian Language Information Processing

Year: 2009

Volume: 8

Issue: 4

Date deposited: 15/07/2010

ISSN (print): 1530-0226

ISSN (electronic): 1558-3430

Publisher: Association for Computing Machinery, Inc.

URL: http://doi.acm.org/10.1145/1644879.1644886

DOI: 10.1145/1644879.1644886


Altmetrics

Altmetrics provided by Altmetric


Share