Welcome to my collection of data science and machine learning tasks. This repository contains a series of single-notebook solutions to various data-related problems.
These tasks represent a range of data analysis and machine learning exercises, typically completed within a single Jupyter notebook. Some of these may be derived from interview challenges I've encountered, while others are practice exercises or small personal projects.
Key points about this collection:
- Each notebook focuses on a specific data analysis or machine learning task
- Tasks vary in complexity but are generally designed to be concise and focused
- The solutions demonstrate my approach to data problems and coding style
- Data used is either synthetic, from public datasets, or modified to ensure privacy
Feel free to explore the notebooks. If you have any questions about the approaches used or would like to discuss any of the tasks, please don't hesitate to reach out!
- Clone the repository:
git clone https://github.com/bab-git/data-science-interviews.git
- Create a virtual environment:
python -m venv .env
souce .env/bin/activate
- Install the required packages: Using Python v3.12.0:
pyenv version 3.12.0
pip install -r requirements_strict.txt
Note: If you are using another version of Python, the above dependency versions may fail to install. In that case, you can use the flexible version below. But then you may need to adjust the notebook code where you find a syntax error.
pip install -r requirements.txt
- Navigate to a specific project folder and open the Jupyter notebook:
cd tasks/task_name
jupyter notebook task_name_solution.ipynb
- Predict Advertisement Response - A supervised model to predict the response of residents to direct mailing advertisements.
- Predictive Modeling for Material Strength - a regression-based predictive model to estimate the material strength.
- Recipe Recommender for Grocery Apps: Presentation slides detailing the business proposal, system architecture, and implementation strategy for a Recipe Recommender System.
- Customer Satisfaction Prediction: A classification model to predict customer satisfaction of an online store based on demographic, transactional, and behavioral data.
- [Other tasks will be added soon]
- Data Cleaning and Preprocessing
- Exploratory Data Analysis (EDA)
- Feature Engineering
- Machine Learning (Supervised and Unsupervised)
- Deep Learning
- Time Series Analysis
- Data Visualization
- Python
- Pandas, NumPy
- Scikit-learn
- TensorFlow, PyTorch
- Matplotlib, Seaborn
- Jupyter Notebooks
While this repository is primarily for showcasing my work, I welcome discussions and suggestions. Feel free to open an issue if you have any questions or ideas for improvement.
This project is licensed under the MIT License - see the LICENSE file for details.
LinkedIn: https://www.linkedin.com/in/bhosseini/
GitHub: https://github.com/bab-git/