Skip to content

MapReduce in Python to show the top word count in a catalog feed

Notifications You must be signed in to change notification settings

vchandev/mapReduceVert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mapReduceVert

First, make mapper.py and reducer.py into executable files by entering the following in Terminal:

$ chmod +x mapper.py
$ chmod +x reducer.py

To run the program, enter the following into Terminal:

$ cat testfeed.tsv | ./mapper.py | ./reducer.py

This basically passes the output from testfeed.tsv to be taken as input for mapper.py, and the same for reducer.py

To print out the results of word count in a text file, use the following command:

$ cat testfeed.tsv | ./mapper.py | ./reducer.py > output.txt

About

MapReduce in Python to show the top word count in a catalog feed

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages