- Using tfidf vectorizer, Logistic Regression, and other Machine Learning models aside with preprocessing techniques used for Arabic Language Tweets to classify Arabic Dialect from text.
- Using AraBERT model version 2 for Deep Learning approach and comparing results with Machine Learning Approach using Confussion Matrix, F1-score.
for more info about the repo please check pdf slides, and check Models Directory for results
- First need to download related packages in conda envirnoment
PyTorch Pandas matplotlib scikit-learn transformers pyarabic emoji nltk
- Make sure you activate env where all packages are downloaded.
- Run
ModelTraining_ML.ipynb
,ModelPrediction-AraBert.ipynb
to get models pickle files - After that go the saved pickle files and copy (or cut) paste to static folder for the FastAPI server within folders for ML models, and other for AraBERT model.
static
│ ├───ML_models
│ └───output_dir
- Run
python main.py
- After running your server go to
localhost:5000/docs
in browser, and try out different POST methods with different text - Enjoy your server app