Machine Learning Kit

The MLkit is based on traditional machine learning based on sklearn.
You can ues most of the estimator in sklearn in MLkit and implement auto standardization, feature selection, Fitting and Prediction.

Installation

Dependencies

        'joblib >= 0.13.2 ',
        'matplotlib >= 3.0.3',
        'mlxtend >= 0.16.0',
        'numpy >= 1.16.4',
        'pandas >= 0.24.2',
        'scikit-learn >= 0.21.2',
        'scikit-plot >= 0.3.7',
        'scipy >= 1.3.0',
        'seaborn >= 0.9.0',
        'sklearn-pandas >= 1.8.0',

User installation

download: https://github.com/WellJoea/MLkit.git
cd MLkit
python setup.py install

useage

MLkit.py -h
usage: MLkit.py [-h] [-V] {Common,Fselect,Fitting,Predict,Score,Auto} ...

The traditional machine learning analysis is based on sklearn package:

1. positional arguments:

{Common,Fselect,Fitting,Predict,Score,Auto}

                        machine learning models help.
    Common              The common parameters used for other models.
    Fselect             Feature selection from standardized data.
    Fitting             Fitting and predicting the training and testing
                        set from estimators.
    Predict             predict new data from fittting process model.
    Score               Scoring the samples.
    Auto                the auto-processing: standardization, feature
                        selection, Scoring, Fitting and/or Prediction.

2. optional arguments:

-h, --help            show this help message and exit
-V, --version         show program's version number and exit

3. Example:

MLkit.py Auto -h

usage: MLkit.py Auto 
		[-h] [-i INPUT] [-g GROUP] [-o OUTDIR] [-m MODEL]
		[-t POOL] [-sc {GSCV,RSCV}] [-nt N_ITER] [-mr MISSROW]
		[-mc MISSCOL] [-mv {mean,median,most_frequent,constant}]
		[-fv FILLVALUE] [-pp] [-nj N_JOB] [-vm CVMODEL]
		[-cm CVFIT] [-s SCALER [SCALER ...]]
		[-qr QUANTILERANGE [QUANTILERANGE ...]]
		[-pt PCATHRESHOLD] [-sb [SELECTB [SELECTB ...]]]
		[-kb SELECTK] [-rf] [-sf] [-cs]
		[-sm [SPECIFM [SPECIFM ...]]] [-st {parallel,serial}]
		[-sp SELECTCV_REP] [-rr RFE_RATE] [-sr SFS_RATE]
		[-cr CFS_RATE] [-kf K_FEATURES [K_FEATURES ...]] [-rm]
		[-rmf RMFISRT] [-rms RMSECOND] [-rmt RMTHIRD] [-tz TESTS]
		[-cv CROSSV] [-pc] [-lr LRMODE] [-mi MAX_INTERVALS]
		[-bp BASEPOINTS] [-bd BASEODDS] [-pd PDO] [-ms MODSCORE]
		[-p PREDICT] [+P PIPELINE [PIPELINE ...]]
		[+M MODEL [MODEL ...]]

Examples:
	MLkit.py Auto -i data.traintest.txt -g group.new.txt -p data.predict.txt -o testdt/ -m DT
	MLkit.py Auto -i data.traintest.txt -g group.new.txt -o testdt/ -m DT -pc -s S M
	MLkit.py Common -i data.traintest.txt -g group.new.txt -o testdt/ -m DT
	MLkit.py Fselect -i data.traintest.txt -g group.new.txt -o testdt/ -m DT -s S
	MLkit.py Predict -p data.predict.txt   -g group.new.txt -o testdt/ -m DT.

4. abbreviation:

All of the estimators you can use as follows (default: XGB):

 classification:.++++++++++++++++++++++
                **RF.................RandomForestClassifier
                **GBDT...............GradientBoostingClassifier
		**XGB................XGBClassifier(+LR/LRCV)
                **MLP................MLPClassifier
		**DT.................DecisionTreeClassifier
                **AdaB_DT............AdaBoostClassifier(DT)
                **LinearSVM..........LinearSVC(penalty='l1')
                **LinearSVMil2.......LinearSVC(penalty='l2')
                **SVMlinear..........SVC(kernel="linear")
                **SVM................SVC(no linear)
                **nuSVMrbf...........NuSVC(kernel='rbf')
                **SGD................SGDClassifier
                **KNN................KNeighborsClassifier
                **RNN................RadiusNeighborsClassifier
                **MNB................MultinomialNB
                **CNB................ComplementNB
                **BNB................BernoulliNB
                **GNB................GaussianNB
                **LR.................LogisticRegression
                **LRCV...............LogisticRegressionCV
Regressioin:....+++++++++++++++++++++
                **RF.................RandomForestRegressor
                **GBDT...............GradientBoostingRegressor
		**XGB................XGBRegressor
		**MLP................MLPRegressor
		**DT.................DecisionTreeRegressor
		**AdaB_DT............AdaBoostRegressor(DT)
                **LinearSVM..........LinearSVR
                **SVMlinear..........SVR(kernel="linear")
                **SVMrbf.............SVR(kernel="rbf")
                **nuSVMrbf...........NuSVC(kernel='rbf')
                **SGD................SGDRegressor
                **KNN................KNeighborsRegressor
                **RNN................RadiusNeighborsRegressor
                **LG.................LogisticRegression
                **LassCV.............LassoCV
                **Lasso..............Lasso
                **ENet...............ElasticNet
                **ENetCV.............ElasticNetCV

Feature selection with tradional statistic method:

-sb --SelectB:
          (default: ['ANVF', 'MI', 'RS', 'MWU', 'TTI', 'PS'])
classification:......+++++++++++++++++
                **VTh.................VarianceThreshold
                **ANVF................f_classif
                **Chi2................chi2
                **MI..................mutual_info_classif
                **WC..................wilcoxon
                **RS..................ranksums
                **MWU.................mannwhitneyu
                **TTI.................ttest_ind
Regressioin:.........+++++++++++++++++
                **VTh................VarianceThreshold
                **ANVF...............f_regression
                **PS.................pearsonr
                **MI.................mutual_info_classif

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Script		Script
Test		Test
MLkit.doc		MLkit.doc
MLkit.py		MLkit.py
README.ch		README.ch
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Kit

Installation

Dependencies

User installation

useage

1. positional arguments:

2. optional arguments:

3. Example:

4. abbreviation:

About

Releases

Packages

Languages

WellJoea/MLkit

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Kit

Installation

Dependencies

User installation

useage

1. positional arguments:

2. optional arguments:

3. Example:

4. abbreviation:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages