Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request includes several changes to improve the installation process, update dependencies, and enhance the functionality of the
stream_topic
package. The most important changes include modifying the installation instructions, updating thesetup.py
file to handle additional requirements and post-installation tasks, and making various improvements to the models.Installation and Dependencies:
README.md
: Updated the installation instructions to usestream-topic
instead ofstream_topic
and added instructions for installing additional requirements for add-ons like visualizations, BERTopic, and DCTE.setup.py
: Added aPostInstallCommand
class to handle downloading NLTK resources after installation and includedextras_require
for optional dependencies. [1] [2] [3] [4]Version Update:
stream_topic/__version__.py
: Updated the version from0.1.9
to0.2.0
.Model Improvements:
stream_topic/metrics/coherence_metrics.py
: Added aget_info
method to provide information about the metric and removed the_load_default_texts
function. [1] [2]stream_topic/models/DCTE.py
: Implemented lazy imports for SetFit components to handle missing dependencies more gracefully and updated thefit
method to use these components. [1] [2] [3]Miscellaneous Changes:
stream_topic/models/KmeansTM.py
,stream_topic/models/bertopicTM.py
,stream_topic/models/cbc.py
: Updated theOneHotEncoder
to usesparse_output=False
instead ofsparse=False
for compatibility with newer versions of scikit-learn. [1] [2] [3]stream_topic/models/abstract_helper_models/base.py
: Removed direct imports ofoptuna
and added dynamic importing to handle missing dependencies. [1] [2]