Web Scraping Project

This project is designed to scrape listing and detail pages from a website and store the extracted data into Google Sheets. Additionally, it downloads images from the detail pages and uploads them to an FTP server.

Setup

Clone the Repository: git clone https://github.com/claudioandriaan/Python_web_scraping_test.git
Install Dependencies:

  pip install BeautifulSoup
  
  pip install selenium 
  
  pip install gspread 
  
  pip install ftputil 
  
  pip install requests

Set Up Google API Credentials:

Obtain Google API credentials (JSON file) and save it as dot.json in the project directory.

Configure FTP Credentials:

Update the FTP server details (host, username, password) in the scrape_data_from_link() function in the script.

Run the Script:

python spiders.py -d <output_directory>

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
spiders.py		spiders.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraping Project

Setup

About

Releases

Packages

Languages

claudioandriaan/Python_web_scraping_test

Folders and files

Latest commit

History

Repository files navigation

Web Scraping Project

Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages