-
Notifications
You must be signed in to change notification settings - Fork 0
/
readme.txt
59 lines (30 loc) · 1.46 KB
/
readme.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
hello,
this is a code for getting similarity score of two labels in your datasets.
I mean the excels and final excel.
okay.
this is final labels in your final excel.
and these are labels in your old excel,, ---database
okay?
then I will try it.
it will show your the most similar labels.
for example
Unit: ['Post Handover Months Number', 'Status'] Post Handover Months Number
this means that Unit is simialr to each elements of ['Post Handover Months Number', 'Status'] and finally most similar 'Post Handover Months Number'
But as you can see, it is not similar in real world, so in that case I will not exchange Unit as the other.
I mean, odd it .
But look:
Status: ['Handover Date', 'Post Handover Months Number', 'Status'] Status
here you can see status is similar to status.
It means my code works well.
okay?
look another output for that.
Price: ['Min Price AED', 'Max Price AED', 'Max Size SQF', 'Handover Date'] Max Price AED
here , 'Price' in Old excel is similar to ['Min Price AED', 'Max Price AED', 'Max Size SQF', 'Handover Date'] and my choice is 'Max Price AED'.
It looks good with real world concepts.
this is my result.
So from now, I will make the code for all of excel files in your database.
And using it, I will exchange labels in database with labels in final excel file.
it is my purpose..good.
then, I will merge all files using pandas library.
after that, you will get one file which you want to get.
thanks for you time~~~