Skip to content

Recommendations for returning status codes / redirect flow #173

Closed Answered by j-mendez
benjohnsonn asked this question in Q&A
Discussion options

You must be logged in to vote

Hi there! I've been experimenting with spider for a bit to crawl page content and various on-page elements.

From what I can understand, the library is more low-level than something like scrapy (which I used to use) and thus, seems to be designed to add scraping functionality to a more robust crawler project.

I was wondering if you have any recommendations for tools/libraries that would also return values like status codes, redirect url chains and more? I've used reqwest a little, not sure if there is a better solution out there though!

for context, one use case would be, feeding urls from Google Search Console, Google Analytics, the sitemap, and urls discovered from a crawl, to build an …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by j-mendez
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants