-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathreadme.txt
37 lines (21 loc) · 1.24 KB
/
readme.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
requirements.txt file is available
methodology_bs4.py = Script to fetch the PMCIDs using Bio Python Package.
Using the PMCIDs , medical articles are webscrapped using
selenium and beautiful_soup
#change the directory of chrome_driver as per your directory
sent_rank.py = script of implementation of Pytextrank as per our usecase
summarygenerator.py = google T5 + sent_rank(text rank algorithm) to generate Extractive
Summarization
keywordsgenerator.py =Gensim model to generate keywords and cosine similarity using spacy
english large model to generate Medical keywords
daysstnadard.py = standardizes all the time and days quantities into days
main_file.py = to run the entire script without streamlit GUI
app_1.py = Streamlit GUI
CLI command from the directory :
streamlit run app_1.py runserver
Transformer models on line 43,44 and 57,58 if not available locally
set local_files_only=False
line 101 articles to be fetched from Pubmed is set to 5,
uncomment line 100 to allow user to decide how many articles to be searched
line 82, change the document save directory and uncomment 128 and 159 lines
correlated_words_0.py and gpt2method.py are not in use