MTP multi gpu version with only one scratch pad
This release follow up over the merging of the branch test into master.
There is a slight difference as this time, the several gpu loop are performed before checking for work update.
ccminer.exe is compiled with cuda 10 and compute_61,sm_61
ccminer_cuda8.exe is for older card (compiled with compute_52; sm_52)