This repository contains a structured collection of sacred texts from Hinduism, formatted in JSON for easy access and analysis. It includes texts from the 'Ramcharitmanas' , 'Srimad Bhagavad Gita', 'Valmiki Ramayana' , 'Rigveda' , 'Yajurveda' and the 'Atharvaveda'.
The dataset is organized into two main directories: Ramcharitmanas
, Srimad Bhagavad Gita
, Mahabharata
, Valmiki Ramayana
, Rigveda
, Yajurveda
, and Atharvaveda
, each containing JSON files for different chapters or काण्ड.
Text | Structure Details | Estimated Number of Verses | Source |
---|---|---|---|
Ramcharitmanas | Composed of chaupais, dohas (sorthas), and various chhands | Approximately 10,000 chaupais | IIT Kanpur Ramcharitmanas Project |
Srimad Bhagavad Gita | Verses (shlokas) in classical Sanskrit | Approximately 700 | Gita Supersite by IIT Kanpur |
Mahabharata | 18 books divided into multiple chapters | Approximately 1,00,000 | Sacred Texts Mahabharata |
Valmiki Ramayana | Verses (shlokas) in classical Sanskrit | Approximately 24,000 | GitHub Repository - Ramayana Book |
Rigveda | Hymns (suktas) divided into mandalas | Over 10,000 (including all Mandalas) | Vedic Heritage Portal |
Yajurveda Shukla | - Vajasaneyi Madhyandina Samhita: Prose and verses (shlokas) - Vajasaneyi Kanva Samhita: Prose and verses (shlokas) |
Madhyandina: Approximately 1,900 Kanva: Approximately 1,980 |
Vedic Heritage Portal |
Yajurveda Krishna (to be added) | Prose and verses (shlokas), often mixed with Brahmana and Aranyaka sections | Varies widely | - |
Atharvaveda | Hymns and prose | Approximately 6,000 | Vedic Heritage Portal |
The Ramcharitmanas
directory contains the following files representing each of the seven काण्डs:
- 1_बाल_काण्ड_data.json - Bal Kānd
- 2_अयोध्या_काण्ड_data.json - Ayodhya Kānd
- 3_अरण्य_काण्ड_data.json - Aranya Kānd
- 4_किष्किंधा_काण्ड_data.json - Kishkindha Kānd
- 5_सुंदर_काण्ड_data.json - Sundar Kānd
- 6_लंका_काण्ड_data.json - Lanka Kānd
- 7_उत्तर_काण्ड_data.json - Uttar Kānd
The SrimadBhagvadGita
directory contains files for each of the 18 chapters of the Bhagavad Gita:
- bhagavad_gita_chapter_1.json - Chapter 1
- bhagavad_gita_chapter_2.json - Chapter 2
- bhagavad_gita_chapter_3.json - Chapter 3
- bhagavad_gita_chapter_4.json - Chapter 4
- bhagavad_gita_chapter_5.json - Chapter 5
- bhagavad_gita_chapter_6.json - Chapter 6
- bhagavad_gita_chapter_7.json - Chapter 7
- bhagavad_gita_chapter_8.json - Chapter 8
- bhagavad_gita_chapter_9.json - Chapter 9
- bhagavad_gita_chapter_10.json - Chapter 10
- bhagavad_gita_chapter_11.json - Chapter 11
- bhagavad_gita_chapter_12.json - Chapter 12
- bhagavad_gita_chapter_13.json - Chapter 13
- bhagavad_gita_chapter_14.json - Chapter 14
- bhagavad_gita_chapter_15.json - Chapter 15
- bhagavad_gita_chapter_16.json - Chapter 16
- bhagavad_gita_chapter_17.json - Chapter 17
- bhagavad_gita_chapter_18.json - Chapter 18
There are 18 books in mahabharata consiting of around 1 Lakhs shlokas, available in
Mahabharata folder
All 7 kaands (~24,000 shlokas) of Valmiki Ramayana are present in the directory. Each kānd is available as a separate JSON file, detailing the various phases of Lord Rama's life:
- Balakanda - बालकाण्ड: Describes the birth of Rama, his childhood and marriage to Sita.
- Ayodhyakanda - अयोध्याकाण्ड: Details Rama's exile, the preparations for his coronation, and his departure to the forest.
- Aranyakanda - अरण्यकाण्ड: Chronicles the forest life of Rama and his encounters with sages and demons.
- Kishkindhakanda - किष्किंधाकाण्ड: Covers the meeting of Rama with Hanuman and the vanara (monkey) kingdom of Kishkindha.
- Sundarakanda - सुंदरकाण्ड: Depicts Hanuman's journey to Lanka, his meeting with Sita, and his fiery escape.
- Yudhhakanda - युद्धकाण्ड: Describes the great war between Rama's army and the forces of Ravana.
- Uttarakanda - उत्तरकाण्ड: Talks about Rama's life after returning to Ayodhya, his coronation, and the banishment of Sita.
Credits to Ramayana book for providing the data.
The Rigveda
directory has been added to the repository, containing JSON files for each of the ten Mandalas:
- rigveda_mandala_1.json - Mandala 1
- rigveda_mandala_2.json - Mandala 2
- rigveda_mandala_3.json - Mandala 3
- rigveda_mandala_4.json - Mandala 4
- rigveda_mandala_5.json - Mandala 5
- rigveda_mandala_6.json - Mandala 6
- rigveda_mandala_7.json - Mandala 7
- rigveda_mandala_8.json - Mandala 8
- rigveda_mandala_9.json - Mandala 9
- rigveda_mandala_10.json - Mandala 10
The Yajurveda
directory has been added to the repository, containing JSON files for the Shukla Yajurveda Samhitas:
- Vajasaneyi Madhyandina Samhita
- Vajasaneyi Kanva Samhita
The Vajasaneyi Madhyandina Samhita
directory contains JSON files for the chapters of the Vajasaneyi Madhyandina Samhita:
- vajasaneyi_madhyandina_samhita.json - Complete Samhita
The Vajasaneyi Kanva Samhita
directory contains JSON files for the chapters of the Vajasaneyi Kanva Samhita:
- vajasaneyi_kanva_samhita_chapters.json - Chapters 2 to 40 (Note: Chapter 1 is missing)
There are 20 kaandas in atharvaveda which are present in Atharavaveda Repo
To use this dataset, simply clone the repository and refer to the individual JSON files as needed. The files are named in a way that should be self-explanatory, with each file corresponding to a specific chapter or काण्ड.
We plan to expand this collection by including the following texts:
- Sam Veda
- Krishna Yajurveda Samhita
- The missing chapter 1 of the Vajasaneyi Kanva Samhita
These will be added in a similar structured format, making it easy for users to access and study these ancient texts.
Contributions to this dataset are welcome. Please submit a pull request or raise an issue if you find any errors or have suggestions for improvements.
This dataset is made available under the Open Database License (ODbL). By using this dataset, you agree to the terms of the license.