Skip to content

App for sorting a very large csv by using chunking. Just for fun.

Notifications You must be signed in to change notification settings

adrianh-za/netcore-process-large-csv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This was done for fun!

The purpose of the app is to sort a large unsorted (ten million records - 500MB) csv file without loading the entire file into memory and sorting,

The app will chunk the large CSV file into smaller files. Each file is then sorted in parallel. Once all the chunked files are sorted, they are then merged into a single sorted CSV file.

The solution consists of two consoles apps, and a single class library

  • GenerateLargeCSV

    • This is for generating the large, unsorted CSV.
  • ProcessLargeCSV

    • This is for sorting the large, unsorted CSV.
  • LibLargeCSV

    • The lib containing the CSV utils and data type.

About

App for sorting a very large csv by using chunking. Just for fun.

Topics

Resources

Stars

Watchers

Forks

Languages