Data-Science-Crate

A one-stop repository for new-comers in Data-Science.

Description

What are the projects?

This repository has two projects -

Classification based project on Cancer prediction Cancer_prediction.ipynb
Regression based project on Stock price prediction L&T_Stock_Price_prediction.ipynb

How can this project help?

Cancer Prediction

Machine learning is not new to cancer research. Artificial neural networks (ANNs) and decision trees (DTs) have been used in cancer detection and diagnosis for nearly 20 years.The fundamental goals of cancer prediction and prognosis are distinct from the goals of cancer detection and diagnosis.
Stock price Prediction

Stock market prediction aims to determine the future movement of the stock value of a financial exchange. The accurate prediction of share price movement will lead to more profit investors can make.

The idea

Cancer Prediction

The idea is to predict whether a cell is cancerous or non-cancerous based on different features of cell using different Machine learning algorithms or Deep learning techniques
Stock Prediction

The idea is to predict the future stock pricing based on different dependencies of a stock using different Machine learning algorithms or Deep learning techniques

Project structure

.
├── Classification
│   ├── Cancer_prediction.ipynb                   Jupyter notebook for Cancer prediction
│   ├── Datasets                                  Dataset for Cancer prediction
│   │   ├── cancer_data.csv
│   │   └── dataset.txt
│   └── classification.txt                        Basic information about Classification
├── Regression
│   ├── Datasets                                  Dataset for L&T stock price prediction
│   │   ├── LT.csv
│   │   └── dataset.txt
│   ├── L&T_Stock_Price_prediction.ipynb          Jupyter notebook for Stock price prediction
│   └── regression.txt                            Basic information about Regression
├── LICENSE
├── code_of_conduct.md
├── contributing.md
└── readme.md

Project roadmap

The project currently does the following things-

Data cleaning
Data preprocessing
Already implemented a very few machine learning algorithms or deep learning techniques

Following things can be implemented -

Data augmentation or manipulation
Better data visualization
Implementation of different Machine learning algorithms or deep learning techniques to achieve better prediction results

Getting started

Prerequisites

Very basic understanding of git and github:
1. What are repositories (local - remote - upstream), issues, pull requests
2. How to clone a repository, how to fork a repository, how to set upstreams
3. Adding, committing, pulling, pushing changes to remote repositories
For EDA and Visualisation
1. Basic syntax and working of python.(This is a must)
2. Basic knowledge of pandas library. Reading this blog might help.
3. Basic knowledge of matplotlib library. Reading this blog might help.
4. Basic knowledge of seaborn library. Reading this blog might help.
5. Basic knowledge of scikit learn library. Reading this blog might help.
6. Basic knowledge of tensorflow library. Reading this blog might help.
However the code is well explained, so anyone knowing the basics of Python can get a idea of what's happenning and contribute to this.

Installing

A step by step series of examples that tell you how to get a development env running.

There are two ways of running the code.

Running the code on web browser.(Google Colab) [Recommended]
- Head on to Google colab
- Then click on Upload Notebook Tab.
- Upload the notebook that you got from this repo.
- Connect with the runtime.
- Upload your dataset.
- Then Click on Run All.
- Start Editing.
You can also run the code locally in your computer by installing Anaconda.
- Install Anaconda. Follow these steps to install Anaconda on your computer
- Install jupyter notebook using conda. Follow these steps to install jupyter notebook.
- Make sure to install pandas,matplotlib,seaborn and scikit-learn to run the notebook.
- Start Editing.

Preview Notebooks

Notebook will be opened in Google Colab

Built with

Google Colab

Contributing

Please read contributing.md for details on our code of conduct, and the process for submitting pull requests to us.

Authors

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Neel Shah

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Heart_disease+prediction		Heart_disease+prediction
Movie_Recommendation_System		Movie_Recommendation_System
NLP-Password-Strength-Classifier		NLP-Password-Strength-Classifier
Stock_sentiment_analysis		Stock_sentiment_analysis
Google Search Analysis .ipynb		Google Search Analysis .ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-Science-Crate

Contents

Description

What are the projects?

How can this project help?

The idea

Project structure

Project roadmap

Getting started

Prerequisites

Installing

Preview Notebooks

Built with

Contributing

Authors

License

Acknowledgments

About

Releases

Packages

Languages

neelshah2409/Data-Science-Crate

Folders and files

Latest commit

History

Repository files navigation

Data-Science-Crate

Contents

Description

What are the projects?

How can this project help?

The idea

Project structure

Project roadmap

Getting started

Prerequisites

Installing

Preview Notebooks

Built with

Contributing

Authors

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages