Skip to content

Latest commit

 

History

History
27 lines (14 loc) · 1.15 KB

README.md

File metadata and controls

27 lines (14 loc) · 1.15 KB

Loan Exploratory Data Analysis

This project is an Exploratory Data Analysis (EDA) with R. The project was created for Udacity's Data Analyst Nanodegree.

Access the final report here: https://sbsousa.github.io/LoanEDA

Project Description

Per Udacity, the goal of this project is to "use R and apply exploratory data analysis techniques to explore relationships in one variable to multiple variables and to explore a selected data set for distributions, outliers, and anomalies". 10-15 variables should be explored.

I selected the Prosper Loans data set from the list of available choices. The data set contains 113,937 loans with 81 variables on each loan.

Approach

The EDA was conducted in R-Studio using multiple R packages such as knitr, diplyr, and ggplot2. The R Markdown file is included as well as the final knitted report in HTML format.

License

This project is publicly available for educational purposes. Please acknowledge this source if you use it.

Sources

The CSV file and EDA project outline were provided by Udacity:

https://www.udacity.com/course/data-analyst-nanodegree--nd002

Additional sources are acknowledged in the code and report.