Missing information and duplicated lines on splitting.py #62

vultor33 · 2019-05-13T20:12:16Z

Instructions

There is some missing information and duplicated lines on splitting.py documentation.
Path: .\fklearn\src\fklearn\preprocessing\splitting.py

Describe the documentation issue

It is in space_time_split_dataset function:

    Returns
    ----------
    train_set : pandas.DataFrame
        The in ID sample and in time training set.

    intime_outspace_hdout : pandas.DataFrame
        The out of ID sample and in time hold out set.  #duplicated line

    outime_inspace_hdout : pandas.DataFrame
         The out of ID sample and in time hold out set. #duplicated line

    holdout_space : pandas.DataFrame    
         The out of ID sample and in time hold out set. #duplicated line



#Should it return holdout_space?

Possible solutions

The following text is my guess of what this function should return:

   Returns
    ----------
    train_set : pandas.DataFrame
        Samples with timestamp >= train_start_date and timestamp < train_end_date
        All IDs are included except from those selected for validation (holdout_space)

    intime_outspace_hdout : pandas.DataFrame
        Samples with same timestamps of train_set
        IDs are selected in holdout_space array
        All rows with selected ID and in specified timestamps are included

    outime_inspace_hdout : pandas.DataFrame
        Samples with timestamp >= train_end_date and timestamp < holdout_end_date
        All IDs are included

    outime_outspace_hdout : pandas.DataFrame
        Samples with same timestamps of outime_inspace_hdout.
        IDs are selected in holdout_space array 
        All rows with selected ID and in specified timestamps are included

The text was updated successfully, but these errors were encountered:

vultor33 added the documentation Missing documentation or improvements in the existing one label May 13, 2019

caique-lima added this to the 1.16.x milestone Sep 2, 2019

caique-lima mentioned this issue Sep 2, 2019

Out of Time In space split is wrong on space_time_split_dataset function #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing information and duplicated lines on splitting.py #62

Missing information and duplicated lines on splitting.py #62

vultor33 commented May 13, 2019

Missing information and duplicated lines on splitting.py #62

Missing information and duplicated lines on splitting.py #62

Comments

vultor33 commented May 13, 2019

Instructions

Describe the documentation issue

Possible solutions