Skip to content

Latest commit

 

History

History
101 lines (57 loc) · 2.31 KB

README.md

File metadata and controls

101 lines (57 loc) · 2.31 KB

Statistics-for-Data-Science

Statistics for Data Science and Machine Learning Handwritten Notes

image

Types of Statistics:

1) Descriptive statistics:

It includes analyzing, summarizing, organizing data in the form of numbers and graphs.

Types: Measure of central tendency- Mean, mode, median Measure of dispersion- range, variance, standard deviation

Graphs: bar plot, histogram, pie chart, pdf, cdf, normal distribution etc.

Here we take population or sample data and analyze, summarize, explore , visualize the data in the form of number and graphs.

2) Inferential statistics:

It is a technique wherein we use the data that we have measured to form conclusions.

eg: To get exit poll results during elections, we can take different samples from different locations from entire population , we are taking some sample and make inferences about population and coming to a conclusion like which party will win.

We take a sample from population and then we try to do some kind of test and then we come up with some inference and conclusion for that population.

It includes confidence interval , z- test, t-test, chi-square test etc.

Topics Covered in Hand Written Notes are:-

  1. Statistics and its type

  2. Variables i. Quantitative variable and its type ii. Qualitative variable and its type

  3. Graph i. Bar-graph ii. Histogram

  4. Measure of central Tendency i. Mean ii. Median iii. Mode

  5. Measure of Dispersion i. Variance ii. Standard Deviation

  6. Percentiles and Quartiles

  7. 5-Number Summary

  8. Boxplot

  9. Normal Distribution

  10. Standard Normal Distribution

  11. Standardization

  12. Normalization i. Min-max scaler

  13. Covariance

  14. Pearson Correlation Coefficient

  15. Spearman's rank correlation coefficient

  16. Poisson Distribution

  17. Bernoulli Distribution

  18. Binomial Distribution

  19. Q-Q plot

  20. Chi-square Test

  21. Hypothesis Testing

  22. Annova Test

  23. 1-sample t-test and 2-sample t-test

  24. Kernel density estimation

  25. Probability density function

  26. P value

  27. Bell curve

References: