Scrape website Data

Project Objective

Scrape data from cars.com and store the data in a spreadsheet.

The data scraped is Name, Mileage, Dealer Name, Ratings, Number of reviews, Price.Python and Jupyter Notebook was used this project. The libraries used are BeautifulSoup,Pandas, Requests and Openpyxl.I have applied 3 filters, certified BMW cars for a particular Zipcode to limit the data scraped for this project

Why is data scraping important

Data scraping, or web scraping, is important for:

Business Intelligence: Gathering market data, competitor analysis, and customer insights.
Market Research: Understanding customer preferences and behavior.
Lead Generation: Collecting contact information for potential customers.
Price Monitoring: Tracking competitor prices and optimizing pricing strategies.
Content Aggregation: Gathering relevant content for marketing and trend analysis.
Academic Research: Gathering large datasets for analysis and study.
Government and Public Data Analysis: Analyzing public datasets and social media data for policy-making and trend identification.

The fields scraped from the website are saved into a Pandas dataframe and it is saved to excel using the Openpyxl library.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
cars_data.xlsx		cars_data.xlsx
cars_data_singlepage.xlsx		cars_data_singlepage.xlsx
scraped_data.ipynb		scraped_data.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrape website Data

Project Objective

Why is data scraping important

About

Releases

Packages

Languages

DataCounsel/DataScraper

Folders and files

Latest commit

History

Repository files navigation

Scrape website Data

Project Objective

Why is data scraping important

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages