The Quranic Arabic Corpus

http://corpus.quran.com/

 

Welcome to the Quranic Arabic Corpus, an annotated linguistic resource which shows the Arabic grammar, syntax and morphology for each word in the Holy Quran. The corpus provides three levels of analysis: morphological annotation, a syntactic treebank and a semantic ontology.

The Quranic Arabic CorpusWhat is the Quran? A significant religious text written in Quranic Arabic and followed by believers of the Islamic faith. The Quran contains 6236 numbered verses (ayāt) and is divided into 114 chapters. An example verse from the Quran:

Verse (21:30) - Have those who disbelieved not considered that the heavens and the earth were a joined entity, and We separated them and made from water every living thing? Then will they not believe?

 

  • Word-by-Word Quran - maps out the syntax of the entire Quran, with analysis and translation
  • Quranic Grammar - traditional Arabic grammar (إعراب) illustrated using dependency graphs

Ontology of Quranic Concepts

The Quranic Ontology uses knowledge representation to define the key concepts in the Quran, and shows the relationships between these concepts using predicate logic. Named entities in verses, such as the names of historic people and places mentioned in the Quran, are linked to concepts in the ontology.

Ontology of Quranic Concepts

Interested in Joining the Project? (يد الله مع الجماعة)

This project contributes to the research of the Quran by applying natural language computing technology to analyze the Arabic text of each verse. The word-by-word grammar is very accurate, but ensuring complete accuracy is not possible without your help. If you come across a word and you feel that a better analysis could be provided, you can suggest a correction online by clicking on an Arabic word.

Interested in Joining the Project

World map of visitors to the Quranic Arabic Corpus provided by Google Analytics.
Countries with the highest number of visitors are shown in dark green.

The map above shows world-wide interest in the Quranic Arabic Corpus. Each day on average, the website receives 10,000 page views and over 1,500 visitors from 135 different countries. Help us review the information on this website so that together we can build the most accurate linguistic resource for Quranic Arabic.

Quranic Arabic Dependency Treebank (QADT)

The Quranic treebank is an effort to map out the entire grammar of the Quran by linking Arabic words through dependencies. The linguistic structure of verses is represented using mathematical graph theory. The annotated corpus provides a novel visualization of Quranic syntax using dependency graphs.

Quranic Arabic Dependency Treebank

See Also
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值