Version 0.2.0 includes several changes to improve the installation process, update dependencies, and enhance the functionality of the stream_topic
package. The most important changes include modifying the installation instructions, updating the setup.py
file to handle additional requirements and post-installation tasks, and making various improvements to the models.
Installation and Dependencies:
README.md
: Updated the installation instructions to usestream-topic
instead ofstream_topic
and added instructions for installing additional requirements for add-ons like visualizations, BERTopic, and DCTE.setup.py
: Added aPostInstallCommand
class to handle downloading NLTK resources after installation and includedextras_require
for optional dependencies. [1] [2] [3] [4]
Model Improvements:
stream_topic/metrics/coherence_metrics.py
: Added aget_info
method to provide information about the metric and removed the_load_default_texts
function. [1] [2]stream_topic/models/DCTE.py
: Implemented lazy imports for SetFit components to handle missing dependencies more gracefully and updated thefit
method to use these components. [1] [2] [3]
Miscellaneous Changes:
stream_topic/models/KmeansTM.py
,stream_topic/models/bertopicTM.py
,stream_topic/models/cbc.py
: Updated theOneHotEncoder
to usesparse_output=False
instead ofsparse=False
for compatibility with newer versions of scikit-learn. [1] [2] [3]stream_topic/models/abstract_helper_models/base.py
: Removed direct imports ofoptuna
and added dynamic importing to handle missing dependencies. [1] [2]