This repository was archived by the owner on Feb 11, 2025. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Final Finishing Touches before Final Presentation.
[NEW] AutoQPGen :: /Assets/QuestionBank_FORMAT/*.pdf :: Added two new question banks for demo purposes that have been pre-tested to work correctly regarding data extraction. [NEW] AutoQPGen :: /Assets/QuestionBank_FORMAT/*.docx :: Added source files (DOCX) for the question banks that are newly added for demo purposes. [NEW] AutoQPGen :: /Assets/QuestionBank_FORMAT/Template/QBank-TEMPLATE.pdf :: Added a new demo template clearly showcasing the accepted new format for question banks that brings increased readbility for both human and AutoQPGen. [NEW] AutoQPGen :: /Assets/QuestionBank_FORMAT/Template/QBank-TEMPLATE.docx :: Added source file (DOCX) for the demo template question bank. [NEW] AutoQPGen :: /models/QScanEngine.py :: Introducing game-changing PDF Extraction library called PDFPlumber which employs state-of-the-art algorithms to extract data that is readable from pdf without redundant white spaces or any such unwanted characters and mainly focusing on "STRUCTURED READING" of PDF content just like human readable way. [FIX] AutoQPGen :: /app.py :: Fixed a fatal error that caused the web app to crash while purging redundant documents. [FIX] AutoQPGen :: /Assets/TestQBanks/ :: This folder has been removed as result of moving to new format and the older question bank format is unsupported. [FIX] AutoQPGen :: /models/QScanEngine.py :: Replaced the FITZ data extraction library with the new PDFPlumber library. [FIX] AutoQPGen :: /models/QScanEngine.py :: Redesigned questionSetter() function to properly sort the operated questions before display and group questions based on Ascending order of Modules. [FIX] AutoQPGen :: /models/QScanEngine.py :: Rerouted the function extract_text_from_pdf() to now get data using PDFPlumber reflecting new format acceptance. [FIX] AutoQPGen :: /models/QScanEngine.py :: Reconfigured SpaCy model to better figure out Subject data from question bank. [FIX] AutoQPGen :: /README.md :: Removed fitz/PyMuPDF dependency issue section as the library is itself no longer used. [FIX] AutoQPGen :: /README.md :: Added a (PDF Structured Extraction) to the Technologies used section. [FIX] AutoQPGen :: /README.md :: Fixed typos in setup instructions. [FIX] AutoQPGen :: /requirements.txt :: Added libraries {PDFPlumber, GoogleGenAI} and removed libraries {PDFkit, fitz, PyPDF2, secrets}.
- Loading branch information
Showing
19 changed files
with
17 additions
and
30 deletions.
There are no files selected for viewing
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,13 +1,11 @@ | ||
pdfkit | ||
fitz | ||
PyPDF2 | ||
pdfplumber | ||
flask | ||
pathlib | ||
tools | ||
frontend | ||
secrets | ||
spacy | ||
docxtpl | ||
docx2pdf | ||
pywin32 | ||
google-generativeai | ||
tk | ||
pywin32 | ||
tools | ||
frontend |