Skip to content

codewithawr/NIC-hydrabad-Startups-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

NIC Hyderabad Startups Data Scraper

This repository contains a Python script and Jupyter Notebook for scraping startup data from the National Incubation Center (NIC) Hyderabad, Sindh, Pakistan website. The data includes information about startups incubated in various cohorts at NIC Hyderabad.

Table of Contents

Introduction

The National Incubation Center (NIC) Hyderabad is a hub for fostering innovation and entrepreneurship in Sindh, Pakistan. This project aims to scrape and compile data on startups from the NIC Hyderabad website. The objective is to gather information on the startups, including their names, descriptions, and links to their detailed profiles, which can be used for further analysis and research.

Dataset

The dataset generated from this project includes:

  • Startup names
  • NIC profile URLs
  • Cohort numbers
  • Additional details from their profile pages

The dataset can be accessed from GitHub: NIC Startups Data

Installation

To run the script and notebook in this repository, you'll need to have Python and the following libraries installed:

  • numpy
  • pandas
  • requests
  • beautifulsoup4

You can install the required libraries using pip:

pip install numpy pandas requests beautifulsoup4

Usage

  1. Clone the repository:

    git clone https://github.com/YOUR_USERNAME/YOUR_REPOSITORY.git
    cd YOUR_REPOSITORY
  2. Run the Jupyter Notebook:

    • Open the Jupyter Notebook NIC_Hyderabad_Scraper.ipynb using JupyterLab or Jupyter Notebook.
    • Execute the cells in the notebook to scrape data from the NIC Hyderabad website and save it to a CSV file.
  3. Run the Python script:

    • Alternatively, you can run the Python script scrape_nic_data.py directly:
      python scrape_nic_data.py
  4. Access the Data:

    • Once the script or notebook has been executed, the scraped data will be saved as NIC_startups_data.csv in the project directory.
    • You can open this CSV file using any spreadsheet software or load it into a data analysis tool such as Pandas for further analysis.

Acknowledgements

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published