Angry Narwhal

A C# console app that will crawl all of the publically available pages within a website, capture each page URL, and store that in a CSV.

To set the domain that you want to crawl, change the baseUrl value in the Program.cs file, e.g.

using AngryNarwhal;

Console.WriteLine("AngryNarwhal v.0.1.0");

var baseUri = "https://examplesite.com"; // Replace with the website you want to crawl
var outputCsvPath = "sitemap.csv"; // Change this if you want to output the CSV to a location other than the bin folder

var crawler = new WebCrawler(baseUri);
var pages = await crawler.CrawlAsync();

CsvWriterHelper.WriteUrlsToCsv(pages, outputCsvPath);

Console.WriteLine($"Crawling complete. Sitemap saved to {outputCsvPath}");

By default, the CSV will be saved in the build folder, so look under bin or change the value of the outputCsvPath variable to point to a specific location.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea/.idea.AngryNarwhal/.idea		.idea/.idea.AngryNarwhal/.idea
AngryNarwhal		AngryNarwhal
.gitignore		.gitignore
AngryNarwhal.sln		AngryNarwhal.sln
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Angry Narwhal

About

Languages

License

davidisnotnull/csv-sitemap-generator

Folders and files

Latest commit

History

Repository files navigation

Angry Narwhal

About

Topics

Resources

License

Stars

Watchers

Forks

Languages