- Description
- How To Get The Respoistory on Your Machine
- Running Jupyter Notebook
- Technologies Used
- Libraries Used
- nbview
This document is relating to numpy.random assessment for Programming For Data Analysis module. The aim of this project is to research and investigate the numpy.random package.
This README contains the documentation for technologies and libraries used for the project.
The Jupyter Notebook and images used can be found at https://github.com/kmcd14/ProgrammingDA-assessment.
- Explain the overall purpose of the package.
- Explain the use of the “Simple random data” and “Permutations” functions.
- Explain the use and purpose of at least five “Distributions” functions.
- Explain the use of seeds in generating pseudorandom numbers.
- Create a folder on your desktop where you wish to store the code
- Navigate to the repoistory https://github.com/kmcd14/ProgrammingDA-assessment.
- Copy the repository address, as seen in the above picture. Using either SSH or HTTPS
- Open the terminal on your desktop and navigate to the folder you created earlier.
- Once in the folder use
$git clone [email protected]:kmcd14/ProgrammingDA-assessment
- Set up pull mode and pull down the contents
----
To access this notebook - numpy-random.ipynb - you will need to install Python preferably through Anaconda distribution.
Google Docs: an online word processor used to write my documentation before transfering into this README file. https://www.google.com/docs/about/
Anaconda: the easiest way to perfrom Python data science machine learning on Windows, Linux and Mac OS. This script was created using Version 4.9.2. https://www.anaconda.com/distribution/
Python: an interpreted, object-oriented, high-level programming language with dynamic semantics. This script was created using Version 3.8.5. https://www.python.org/
GitHub: is a code hosting platform for collaboration and version control. https://github.com/
Jupyter Lab/Jupyter Notebook: a web-based interactive development environment for Jupyter notebooks, code, and data. https://jupyter.org/
“Python is a widely-used, interpreted, object-oriented, and high-level programming language with dynamic semantics, used for general-purpose programming. It was created by Guido van Rossum, and first released on February 20, 1991” (Python Institute, 2021). It is suitable for research, prototyping and building production systems, which eliminates the need for using different languages for both. Python has a vast and continuously growing library to choose from which makes it perfect for data analysis, such as Numpy and Pandas. It is a robust, flexible and efficient language which provides many solutions and avenues to approach and solve problems.
Numpy is a Python library used for working with arrays. It produces a narray object. Numpy arrays are faster and more efficient than using python lists. It does this by storing arrays in one place in memory, so they can be accessed and manipulated quickly http://www.numpy.org/
Mathplotlib is a python library used to create plots, graphs, charts etc. https://matplotlib.org/
Seaborn is based on matplotlib and is also used to create data visualisations. “It provides a high-level interface for drawing attractive and informative statistical graphics.” https://seaborn.pydata.org/
If your system does not have these libaries installed enter the below command from the command line:
$pip install <library name>
Additionally, you can view the notebook by clicking on the following icon: