Welcome to this meticulously curated list of resources dedicated to Natural Language Processing (NLP) in the Arabic language! Arabic is known for its complexity and richness, making advancements in NLP for this language a challenging yet rewarding endeavor.
In this repository, youโll find a wide array of resources including academic papers, tools, datasets, libraries, and best practices, all specifically tailored to Arabic NLP. Whether you are a researcher, developer, or someone simply interested in applying NLP techniques to Arabic text, this list is an invaluable resource.
The resources included cover a broad spectrum of topics ranging from syntactic analysis, machine translation, named entity recognition, text classification, and much more, all while addressing the unique challenges and characteristics of the Arabic language.
Feel free to contribute to this awesome list by submitting a pull request or suggesting new resources. Together, we can build a comprehensive and up-to-date repository that benefits the entire community working on Arabic NLP. Enjoy your learning journey!
- GitHub projects
- Articles & Blogs
- Online Courses
- Books
- Research Papers
- Videos
- Tools & Software
- Conferences & Events
- Slides & Presentations
- Podcasts
- arabert : Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA) ๐
- ARBML : Implementation of many Arabic NLP and CV projects. Providing real-time experience using many interfaces like web, command line, and notebooks. ๐ป
- Shakkala : Deep learning for AR text Vocalization - ุงูุชุดููู ุงูุงูู ูููุตูุต ุงูุนุฑุจูุฉ ๐
- arabic-stop-words : Largest list of Arabic stop words on GitHub. ุฃูุจุฑ ูุงุฆู ุฉ ูู ุณุชุจุนุฏุงุช ุงูููุฑุณุฉ ุงูุนุฑุจูุฉ ุนูู ุฌูุช ูุงุจ ๐
- Hadith-Data-Sets : All Hadith With Tashkil and Without Tashkeel from the Nine Books that are 62,169 Hadith. ๐
- ar-php : Set of functionalities enable Arabic website developers to serve professional search, present, and process Arabic content in PHP ๐
- Maha : Maha is a text processing library specially developed to deal with Arabic text. ๐
- tajmeeaton : ุชุฌู ูุนุฉ ู ู ุงูู ุดุงุฑูุนุ ูุฎุตูุตุง ู ูุชูุญุฉ ุงูู ุตุฏุฑุ ูููููุถ ุจุงููุบุฉ ุงูุนุฑุจูุฉ ูุงูุฃู ุฉ. ๐จโ๐ป ๐จโ๐ฌ๐จโ๐ซ๐ง
- SOQAL : Arabic Open Domain Question Answering System using Neural Reading Comprehension โ
- Arabic-BERT : Arabic edition of BERT pretrained language models
- ARBML/masader ๐๐๐ The largest public catalogue for Arabic NLP and speech datasets. Includes +500 datasets annotated with more than 25 attributes.
- Qutuf/Qutuf ๐๐๐ Qutuf (ููุทูููู): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
- motazsaad/process-arabic-text ๐๐๐ Pre-process Arabic text (remove diacritics, punctuations, and repeating characters).
- UBC-NLP/marbert ๐๐๐ UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic.
- MagedSaeed/farasapy ๐๐๐ A Python implementation of the Farasa toolkit.
- saidziani/Arabic-News-Article-Classification ๐๐๐ Automatic categorization of documents based on their content using Supervised Machine Learning.
- iamaziz/ar-embeddings ๐๐๐ Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec.
- mohabmes/Arabycia ๐๐ Arabic NLP tool for Text Search, POS tagging, Translation, auto-diacritization, and more.
- adhaamehab/textblob-ar ๐๐ Arabic support for textblob library.
- motazsaad/arabic-sentiment-analysis ๐๐ Sentiment Analysis in Arabic tweets.
- GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP: A large-scale evaluation of ChatGPT on various Arabic NLP tasks, showcasing its performance compared to dedicated models finetuned on Arabic.
- Diving into Arabic NLP: A Beginner's Guide to Dialect Identification: An exploration of the challenges and dynamics of Arabic dialect identification in NLP.
- Camel_tools, a Python Toolkit for Arabic NLP: A step-by-step explanation of how to use camel_tools for sentiment analysis in Arabic comments.
- Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models: An assessment of GPT-3.5 and GPT-4 models on seven distinct Arabic NLP tasks, highlighting their performance and insights into sentiment analysis.
- Masader Plus: A New Interface for Exploring+ 500 Arabic NLP: Introducing Masader Plus, a web interface for browsing and exploring the Masader catalog of Arabic NLP datasets.
- Machine learning advancements in Arabic NLP: An overview of effective tools and code examples for Arabic NLP, focusing on social media text.
- Automate Arabic NLP with Apache Spark: Tutorial with code on combining big data with Arabic NLP using Apache Spark and Spark NLP for distributed computing.
- Arabic NLP: Unique Challenges and Their Solutions: A concise overview of the challenges and tools for working with Arabic text in NLP projects, including pre-processing using the camel-tools Python package.
- Building Yarub Library for Arabic NLP Purposes: Insights and interesting code snippets on building a library for Arabic NLP using Yarub.
- Arabic Sentence Embeddings with Multi-Task Learning: Tutorial on creating Arabic sentence embeddings with Multi-Task Learning for efficient Semantic Textual Similarity.
- AraBERT: Transformer-based Model for Arabic Language: This paper introduces AraBERT, a BERT-based model specifically pre-trained for Arabic language understanding, achieving state-of-the-art results in various Arabic NLP tasks like Sentiment Analysis and Named Entity Recognition.
- A Panoramic Survey of Natural Language Processing in the Arab: This survey explores the field of Natural Language Processing (NLP) in Arabic, discussing its unique challenges compared to artificially constructed languages. It covers a wide range of NLP applications and their connections to different disciplines.
- Post-hoc analysis of Arabic transformer models: This article analyzes and compares transformer models trained on different dialects of Arabic. The study investigates how linguistic information is encoded in these models, revealing insights about word morphology, syntactic dependencies, and capturing nuances of Arabic dialects.
- Machine Learning and NLP For Arabic: Part Of Speech Tagging: This article discusses the use of Deep Learning, specifically Recurrent Neural Networks, for Part-Of-Speech tagging in Arabic texts. It focuses on building open-source NLP libraries and tools for the Arabic language.
- AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing: This work presents AraMUS, the largest Arabic Pre-trained Language Model (PLM) with 11B parameters, trained on a vast amount of high-quality Arabic textual data. AraMUS achieves state-of-the-art performances on various Arabic classification and generative tasks.
- Arabic Sentiment Analysis. An Illustrative guide on how to perform sentiment analysis on Arabic texts: This article provides a guide on performing sentiment analysis specifically on Arabic texts, discussing the process and techniques involved.
- How to Detect and Translate Languages for NLP Project: This article discusses language challenges that Data Scientists or Machine Learning Engineers may face while working on NLP projects, focusing on detecting and translating multiple languages to a single language.
- Arabic natural language processing: An overview: This survey provides an overview of the work done on Arabic natural language processing, covering all three main varieties of Arabic and discussing recent resources and tools. It classifies the research papers based on the language variety and associated techniques.
- Comparing Arabic dialect datasets: This blog post compares Arabic dialect datasets for classification tasks, highlighting the use of embeddings and data flow visualization.
- Sentiment Analysis in Arabic: This article compares open-source libraries for sentiment analysis in Arabic and explores analyzing text in Arabic using Google AutoML tools.
- ููุบุฉ ุงูุนุฑุจูุฉ NLP - Arabic Natural Language Processing | Udemy course that will prepare you to use Natural Language Processing for the Arabic Language. We will cover the fundamentals of Machine Learning/ NLP.
- ุงูุจุฑู ุฌุฉ ุงููุบููุฉ ุงูุนุตุจูุฉ | NLP | Coursera
- Natural Language Processing Specialization 4 courses Offered by deeplearning.ai. Natural Language Processing (NLP) uses algorithms to understand and manipulate human language. This technology is one of the most broadly applied areas of machine learning. By the end of this Specialization, you will be ready to design NLP applications that perform question-answering and sentiment analysis, create tools to translate languages and summarize text, and even build chatbots.
- Natural Language Processing Course | Udacity
- Natural Language Processing with Attention Models Course
- Natural Language Processing in TensorFlow | Coursera
- NLP - Natural Language Processing with Python | Udemy
- NLP ุงูุจุฑู ุฌุฉ ุงููุบููุฉ ุงูุนุตุจูุฉ | Udemy
- Learn BERT - most powerful NLP algorithm by Google | Udemy: Understand and apply Google's game-changing NLP algorithm, BERT, to real-world tasks. Build 2 NLP applications.
- Introduction to Arabic Natural Language Processing | SpringerLink โญ๏ธ This book provides system developers and researchers in NLP and computational linguistics with necessary information for working with the Arabic language.
- Recent Advances in NLP: The Case of Arabic Language โญ๏ธ This book addresses common challenges in intelligent natural language processing related to the Arabic language.
- Arabic natural language processing for Qur'anic research: a โญ๏ธ his article focuses on Arabic natural language processing for Qur'anic research and its implications.
- Arabic Language Processing: From Theory to Practice: 6th โญ๏ธ This book includes revised selected papers from the 6th International Conference on Arabic Language Processing, covering theory and practice.
- ุงูุจุฑู ุฌุฉ ุงููุบููุฉ ุงูุนุตุจูุฉ NLP (Arabic Edition) eBook - Amazon.com Arabic edition of the book "Neural Linguistic Programming (NLP)", specifically tailored to Arabic NLP.
- Natural language processing: state of the art, current trends and A research article discussing the state of the art in natural language processing, including work done on Arabic language analysis.
- Improve Your Writing With NLP: Judith E Pearson: 9781845908614 This book explores NLP strategies that can transform novice writers into accomplished ones, utilizing resourceful states, beliefs, cognitive strategies, and behaviors of highly skilled writers.
- Advances in Natural Language Processing: 7th International A book in the Lecture Notes in Computer Science series, covering advances in natural language processing.
- Anaphor Resolution in Arabic using A Hybrid approach: A Machine This book focuses on Arabic anaphora resolution, an important task in NLP applications such as machine translation and information retrieval.
- A Lexical Database for Modern Standard Arabic Interoperable with a Presents a lexical database for Modern Standard Arabic that addresses the issue of including both obsolete and current words.
- Tharawat: A Vision for a Comprehensive Resource for Arabic A paper describing a vision for a unified lexical resource for computational processing of Arabic, including its variants.
- Multilingual Natural Language Processing Applications: From This book covers important contributions in multilingual NLP applications from leading researchers in the field.
- Support Vector Machines based Arabic Language Text Explores the effectiveness of feature selection methods for Arabic language text classification using Support Vector Machines.
- Arabic aspect sentiment polarity classification using BERT | Journal Discusses the application of Neural Networks for sentiment analysis in Arabic NLP.
- GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP : A large-scale evaluation of ChatGPT on a broad range of Arabic NLP tasks, revealing its performance on 32 diverse natural language understanding and generation tasks.
- Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models : Assessing the performance of GPT-3.5 and GPT-4 models on seven distinct Arabic NLP tasks, including sentiment analysis, translation, and summarization.
- Masader Plus: A New Interface for Exploring 500+ Arabic NLP Datasets : Introducing Masader Plus, a web interface for browsing and exploring Arabic NLP datasets, offering data exploration, filtration, and a simple API.
- AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP : AraVec provides pre-trained distributed word representation models for use in Arabic NLP research.
- A Comparative Study on Various Deep Learning Techniques for NLP : Comparing the performance of different deep learning techniques, such as LSTM and LSTM-CRF, for tasks at the syntactic level in NLP, specifically focusing on Arabic.
- Arabic Natural Language Processing: An Overview : A survey focusing on recent research papers that present and classify advancements in Arabic NLP.
- A Panoramic Survey of Natural Language Processing in the Arab World : An overview of natural language processing (NLP) and its application to natural languages such as Arabic, highlighting the interdisciplinary nature of NLP.
- Arabic Natural Language Processing for Qur'anic Research : Leveraging NLP techniques for facilitating Qur'anic research.
- Imam: Word Embedding Model for Islamic Arabic NLP : Introducing an efficient distributed word representation model for various NLP tasks related to the Islamic domain.
- The Quest for NLP Applications and Tools: The Case of Standard Arabic and Dialects : Exploring research efforts in Arabic NLP, focusing on tools, applications, and resources related to both modern standard Arabic and Arabic dialects.
- AraBERT: Transformer-based Model for Arabic Language - This paper presents AraBERT, a pre-trained BERT model specifically designed for Arabic language understanding. It achieves state-of-the-art performance on various Arabic NLP tasks.
- Introduction to Arabic Natural Language Processing - This book provides an introduction to Arabic Natural Language Processing (NLP) by Nizar Habash, a research scientist at Columbia University.
- Arabic NLP tools for ontology construction from Arabic text - This paper discusses the use of NLP techniques for ontology construction from Arabic text, focusing on the effectiveness of various preprocessing techniques.
- Graph-based Arabic NLP Techniques: A Survey - This paper provides a survey of graph-based techniques used in Arabic NLP, highlighting their role in improving NLP applications.
- A Review Study for Arabic Machine Learning and Deep Learning - This research survey focuses on the use of Deep Learning (DL) methods in Arabic NLP and highlights the evolution of research in this field.
- AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing - This paper presents AraMUS, the largest Arabic Pre-trained Language Model (PLM) with state-of-the-art performance on Arabic classification and generative tasks.
- Exploring NLP web APIs for building Arabic systems - This paper explores available NLP Web APIs that support the Arabic language and discusses their strengths, weaknesses, and potential future use.
- Comparative Study of the Most Useful Arabic-supporting Natural - This paper presents a comparative study of Python and Java libraries for Arabic Natural Language Processing (ANLP) and deep learning, focusing on their suitability for handling Arabic language specificities.
- Deep learning for Arabic NLP: A survey - This survey explores the use of Deep Learning (DL) techniques for NLP, focusing on the Arabic language due to its importance and scarcity of research in this area.
- Arabic Natural Language Processing: Challenges and Solutions - This paper discusses the challenges faced by researchers and developers in building NLP applications for Arabic text and speech.
- Sakhar AlKhereyf - Intro to Arabic NLP | PyData Jeddah - YouTube
- Grammarly AI-NLP Club #8 - Arabic Natural Language Processing - YouTube - Apr 3, 2019 Speaker: Nizar Habash, an Associate Professor of Computer Science at New York University Abu Dhabi Presentation:\xa0
- A Short Introduction to Arabic Natural Language Processing by Dr - YouTube
- A Short Introduction to Arabic Natural Language Processing (Nizar - YouTube - Oct 16, 2022 PLENARY PRESENTATION A Short Introduction to Arabic Natural Language Processing Nizar Habash New York University Abu Dhabi Abstract: Arabic\xa0
- Introduction to Arabic Natural Language Processing (Nizar Habash - YouTube - Nov 6, 2022 Learn more.
- NLP with Friends, Featured Friend: Zaid Alyafeai - YouTube - Aug 11, 2021 Fostering Research in Arabic NLP via Open Source Collaboration.
- Nizar Habash: Introduction to Arabic Natural Language Processing - YouTube - Nov 6, 2022 Learn more. Open App. Nizar Habash: Introduction to Arabic Natural Language Processing - Part 2. 65 views\xa0
- Stanford CS224N: Natural Language Processing with Deep - YouTube - Jun 22, 2022 For more information about Stanford's Artificial Intelligence professional and graduate programs, visit: https://stanford.io/ai. For more information about Stanfordโs Artificial Intelligence professional and graduate programs, visit: https://stanford.io/ai
- Grammarly AI-NLP Club - YouTube - The idea of Grammarly AI-NLP Club is to bring together Ukrainian and international leaders in the field, specialists interested in AI, Machine Learning,\xa0
- Conversational AI / NLP - YouTube - Aug 28, 2023 Resources for building and deploying state-of-the-art GPU-accelerated conversational #AI applications. . . . #speechrecognition #NLP #texttospeech
- Games and NLP - YouTube: Stroll with a Scroll - Coreference Annotation of an Arabic Corpus using a Virtual World Game. Play games and help with research in Natural Language Processing.
- Games and NLP - YouTube: Learn about LingoTowns, a game-with-a-purpose developed by DALI that features mini games for Natural Language Processing research.
- ChrisMcCormickAI - YouTube: Easy-to-follow tutorials on Machine Learning and NLP topics.
- Nizar Habash: Introduction to Arabic Natural Language Processing - Part 2: Introduction to Arabic NLP by Nizar Habash.
- Weekly Research Seminar - Prof. Nizar Habash - Arabic & AI: Research seminar on Arabic NLP by Prof. Nizar Habash.
- TensorFlow Meetup tunis - Arabic Natural Language Processing: Exploring various NLP applications in Arabic using TensorFlow.
- 13 - NLP POS Tagging Arabic NLTK - YouTube: Lecture on POS tagging in Arabic NLP.
- Arabic - NLP (Arabic Text classification - Multiclass - Keras - Neural): Video tutorial on Arabic text classification using Keras and Neural Networks.
- Nizar Habash Introduction to Arabic Natural Language Processing - Part 1: Part 1 of the introduction to Arabic NLP by Nizar Habash.
- KALIMAT Multipurpose Arabic Corpus: A corpus that could be of help for researchers working on Arabic NLP. It consists of 20,291 Arabic articles collected from the Omani newspaper Alwatan.
- EASC (Essex Arabic Summaries Corpus): Arabic natural language resources containing 153 Arabic articles and 765 human-generated extractive summaries of those articles.
- Khawas: An Arabic Corpora Processing Tool for analyzing Arabic corpora.
- NLTK: A leading platform for building Python programs to work with human language data, including NLP libraries and an active discussion forum.
- Stanford CoreNLP: A Java suite of core NLP tools, providing linguistic annotations such as tokenization, parts of speech, named entities, sentiment analysis, and more.
- Arabic Corpus: A collection of more than 460 Arab books that can be used for language engineering applications.
- Osman Arabic Text Readability: An open-source tool for measuring Arabic text readability, allowing users to calculate readability for Arabic text with or without diacritics.
- Alkhalil Morpho Sys: A morphosyntactic parser for Arabic words that can process both vocalized and non-vocalized texts.
- Best Natural Language Understanding (NLU) Software in 2023 | G2: Natural language understanding (NLU), a form of natural language processing (NLP), allows users to better understand text through machine learning. This website provides real-time, up-to-date product reviews from verified users to help you choose the right NLU software.
- spaCy download | SourceForge.net: spaCy is an industrial-strength NLP library built on the latest research for advanced NLP in Python and Cython. It is designed for real-world applications and can be used for building products and gaining insights.
- Natural Language Toolkit download | SourceForge.net: The Natural Language Toolkit (NLTK) is a library for NLP. It provides tools and resources for tasks such as tokenization, stemming, lemmatization, parsing, semantic reasoning, and more.
- Find Intellectual Discussion Events & Groups in Dubai, AE: Intellectual discussion events near Dubai, AE. Connect with people who share your interests and attend online or in-person events.
- Education & Training Events in Israel, List of all Israel Educational: Discover upcoming education and training events in Israel. Find conferences, talks, and seminars related to NLP for Hebrew and Arabic.
- Find Learn Coding Events & Groups in Dubai, AE: Explore learn coding events near Dubai, AE. Connect with like-minded individuals and join online or in-person events.
- 10th World Machine Learning and Deep learning Conference : November 09-10, 2023 Stockholm, Sweden : Renowned Speakers : Dr. Ebrahim Al Alkeem Al Zaabi.
- Conference in Prague : June 28, 2022 : A conference discussing various topics related to NLP and Arabic text analysis.
- MoM2010: Arabic natural language processing: A presentation on the challenges and advancements in Arabic Natural Language Processing, covering grammar, syntactic analysis, and more. ๐
- Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and Solutions: A comprehensive discussion on the unique challenges faced in Arabic NLP and the potential solutions. ๐
- Processing Arabic Text: Slides introducing Natural Language Understanding (NLU) with a focus on Arabic text. ๐
- Madad: Arabic Annotation Tool for Arabic Text: An overview of the Madad annotation tool specifically designed for Arabic NLP tasks. ๐
- Benchmarking transfer learning approaches for NLP: A presentation calling for collaboration in applied transfer learning for Arabic text classification tasks. ๐
- Spark NLP: State of the Art Natural Language Processing at Scale: Slides exploring the capabilities and advancements of Spark NLP for Arabic NLP. ๐
- Language Detection Library for Java: An informative presentation on a language detection library for Java, with a focus on distinguishing between Persian and Arabic languages. ๐
- Towards Responsible NLP: Walking the walk: A thought-provoking presentation on responsible NLP practices with insights into how they can be applied to a polyglossic language like Arabic. ๐
- NLP Literature Survey with focus on Computerized Deception Detection: A literature survey on NLP techniques in the context of computerized deception detection. ๐
- Applied NLP - Poster: A poster presentation showcasing the application of NLP techniques, including Arabic natural language processing. ๐
- NLP Literature Survey with focus on Computerized Deception Detection - A literature survey on NLP with a focus on computerized deception detection.
- Habash: Arabic Natural Language Processing - A presentation on Arabic natural language processing, highlighting its challenges and techniques.
- Lightweight Natural Language Processing (NLP) - A presentation introducing lightweight approaches to natural language processing.
- Usage of regular expressions in NLP - A presentation discussing the use of regular expressions in NLP and their applications.
- Grammarly AI-NLP Club #4 - Understanding and assessing language with neural network models - Marek Rei - A presentation on understanding and assessing language using neural network models.
- NLP Based Text Summarization Using Semantic Analysis - A presentation on text summarization using semantic analysis techniques in NLP.
- NLP communication skills! - A presentation focusing on NLP techniques for effective communication and building rapport.
- Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With Python - A tutorial on stemming and lemmatization in NLP using Python.
- NLP pipeline in machine translation - A presentation discussing the components and process of NLP pipeline in machine translation.
- NLP - An overview of NLP basics, including focus on POS tagging and token classification in Bengali language.
- NLP Highlights: In this podcast, researchers from the AllenNLP team at Allen Institute for AI discuss their work in various areas of natural language processing.
- NLP MasterCLASS: Hosted by NLP Master Trainers Tina Taylor and Steve Crabb, this podcast explores the amazing ways to use NLP for personal and professional changes.
- NLP Talks with Laura Evans: Hosted by Laura Evans, an International Trainer of NLP, this podcast features insightful interviews with people who know how NLP changes lives and offers tips and strategies for success.
- The Brain Language Podcast: This podcast introduces NLP concepts that enhance personal and business life, providing golden nuggets of NLP knowledge.
- WARA Media & Language Podcast: In this podcast, you will hear the latest research within AI in the field of Media, Language, and Gaming, with insights from industry leaders and tech companies.
- Microsoft Research Podcast: This podcast brings you conversations with researchers at Microsoft, discussing cutting-edge advancements in technology.
- NLP Talks: A podcast in Greek about Neuro-Linguistic Programming (NLP) by Athens NLP Studies, aiming to help individuals create positive and lasting changes in their lives.
- Women in AI: A biweekly podcast featuring leading female minds in AI, Deep Learning, and Machine Learning, discussing cutting-edge work, technological advancements, and the impact of AI for social good and diversity in the workplace.
This initial version of the Awesome List was generated with the help of the Awesome List Generator. It's an open-source Python package that uses the power of GPT models to automatically curate and generate starting points for resource lists related to a specific topic.