#putting the data together:

##take the numerical data from the original data
X_num = data[['age', 'bmi', 'children']].copy()

##take the encoded data and add to numerical data
X_final = pd.concat([X_num, region, sex, smoker], axis = 1)

#define y as being the "charges column" from the original dataset
y_final = data[['charges']].copy()

#Test train split
X_train, X_test, y_train, y_test = train_test_split(X_final, y_final, test_size = 0.33, random_state = 0 )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4. Dividing the Data into Test and Train.md

4. Dividing the Data into Test and Train.md

Files

4. Dividing the Data into Test and Train.md

Latest commit

History

4. Dividing the Data into Test and Train.md

File metadata and controls