covid-19-classification

Code Status

Overview

This work showcases a binary classifier for chest X-ray images which distinguishes between COVID-19 and no finding (healthy) using a convolutional neural network. Additionally, both LIME- and GradCAM-explainer are integrated into a web interface.

Prerequisites

python3 and node.js
a set of x-ray images
a trained model for covid-19 classification (see Training)
a model for segmentation of the lungs (trained_model.hdf5 from https://github.com/imlab-uiip/lung-segmentation-2d)

Install

pip install -r requirements.txt
npm install
cd src/frontend
npm install

Training

The training of the classification model is performed from a jupyter notebook. The notebook contains further documentation for the training steps and necessary datasets and models.

See training.ipynb.

Usage

The application is divided into a backend and a frontend.

The backend consists of python workers which perform classifications and explanations of chest x-ray images. The API itself is written in javascript (node.js) and merely forwards requests to a python worker (server.py) which starts a thread for each classification/explanation task. The communication between the API and the python worker uses the standard input/output streams and is structured as follows:

A request for a method, e.g. classification containing a x-ray image payload is send to the corresponding endpoint, e.g. POST /v1/classifier.
The node.js API (index.js) validates the request, stores the image on disk and assigns a unique identifier to it.
The node.js API send a single line to the stdin of the server.py-process consisting of: METHOD ID e.g. classify f00091ff-cb7a.
The server.py-process starts a thread for the specific task and prints METHOD ID RESULT on stdout once the task finishes.
Finally, the API can answer the HTTP-request.

The server.py-process is initialized on startup and kept running for the entire lifecylce of the API-process.

The frontend is based on react.js.

Start the backend/API:
```
$ node src/index
usage: index [-h] -c MODEL_PATH -s SEGMENTATION_MODEL_PATH --cache-dir-path
            CACHE_DIR_PATH [--disable-api-cache]
            [--api-cache-lifetime API_CACHE_LIFETIME] [-p PORT] [-ip HOST]

Covid-19 Classification API

optional arguments:
  -h, --help            show this help message and exit
  -c MODEL_PATH, --model-path MODEL_PATH
                        path to classification model
  -s SEGMENTATION_MODEL_PATH, --segmentation-model-path SEGMENTATION_MODEL_PATH
                        path to segmentation model (U-Net)
  --cache-dir-path CACHE_DIR_PATH
                        path to cache dir
  --training-dir-path TRAINING_DIR_PATH
                        path to training queue dir
  --disable-api-cache   path to cache dir
  --api-cache-lifetime API_CACHE_LIFETIME
                        api cache lifetime in minutes
  -p PORT, --port PORT  api port
  -ip HOST, --host HOST
                        api host
```
--model-path: required, contains the path to the CNN / classification model
--segmentation-model-path: required, contains the path to the U-Net, which is used to perform segmentations of the lungs prior to classification.
--cache-dir-path, required, contains the path to a cache directory. Some tasks such as segmentation need a directory where artifacts such as masks can be stored. The path needs to point to a (arbitrary) writeable directory.
--training-dir-path, required, contains the path to a training directory. In order to store new training images, which can be uploaded via the web-interface, a directory is needed. The path needs to point to a (arbitrary) writeable directory.
--disable-api-cache, optional, use this flag to disable the api cache. Usually, identical requests (e.g. classification of the same image) are resolved using a cache.
--api-cache-lifetime, optional, default: 5 minutes, use this parameter to chance the lifetime of the cache entries.
--port, optional, default: 3005, change the port of the api.
--host, optional, default: localhost, change the host of the api.
Start the frontend:
```
cd src/frontend
npm start
```
The frontend will try to start on port 3000 and forward API requests to http://localhost:3005/ if possible. You can change your proxy path to the actual API host/port in frontend/package.json:
```
"proxy": "http://localhost:3005",
```

Python worker interface

Internally, the node.js-API starts the python worker and organizes the communication. Hence, there is no need to directly access it. However, it can be used to perform a set of classifications and explanations on many images. The interface is structured as follows and the parameters are a subset of the API-parameters. The parameters provide the model files and the actual tasks are provided via stdin.

$ ./src/server.py
usage: server.py [-h] -c MODEL_PATH -s SEGMENTATION_MODEL_PATH
                 --cache-dir-path CACHE_DIR_PATH

Covid-19-Classification Server

The server accepts messages in the form of
"command image_id" e.g. "explain_lime f00091ff-cb7a"
on stdin. Once a command finished, the server
replies with the same message on stdout, 
followed by optional response parameters.
Allowed message types are: "classify", 
"explain_lime" and "explain_gradcam". 

The images have to be located in
"CACHE_DIR_PATH/IMAGE_ID.png".

optional arguments:
  -h, --help            show this help message and exit
  -c MODEL_PATH, --model-path MODEL_PATH
                        path to classification model
  -s SEGMENTATION_MODEL_PATH, --segmentation-model-path SEGMENTATION_MODEL_PATH
                        path to segmentation model (U-Net)
  --cache-dir-path CACHE_DIR_PATH
                        path to cache dir

A single image can be classified using:

echo "classify f00091ff-cb7a" | ./src/server.py -c data/model20200905-193900.h5 -s data/trained_model.hdf5 --cache-dir-path cache

Attributions

Icon made by Freepik from www.flaticon.com

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
assets		assets
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
task_description.md		task_description.md
training.ipynb		training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

covid-19-classification

Code Status

Overview

Prerequisites

Install

Training

Usage

Python worker interface

Attributions

About

Releases

Packages

Contributors 3

Languages

ULDataScience/covid-19-classification

Folders and files

Latest commit

History

Repository files navigation

covid-19-classification

Code Status

Overview

Prerequisites

Install

Training

Usage

Python worker interface

Attributions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages