Skip to content

AhmadM-DL/Hadith-Segmentation-Using-Ngrams

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hadith-Segmentation-Using-Ngrams

image

Islamic ahadith are historical narrations that describe the acts and statements of Prophet Mohammad (PBUH), his household, and his Companions. Those ahadith are collected and compiled into different books by early Islamic scholars. They are still enjoying the attention and are of much importance for historical/Islamic studies. And they are being used by Islamic scholars to extract the sharia law as the Holy Quran is not the only resource for Islamic laws.

Throughout the years, Islamic hadith collectors have developed a unique structure to record hadith. A single hadith is mainly composed of two parts:

  • Sanad (Arabic.<السند>): Which lists the human channel in which the hadith was propagated and is usually called the narrators chain. In other words, it mentions that person x told person y ... that he heard person z said.
  • Maten (Arabic. <المتن> ): Which contain the body of the hadith/statement original sayer (in this project we targeted Mohammad (PBUH) hadith only) that the narrators propagated.

This repository showcases a Natural Language Processing (NLP) Technique that uses Ngrams to predict the split between sanad and maten for a given hadith. As in the below example:

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published