Skip to content

Latest commit

 

History

History
12 lines (11 loc) · 525 Bytes

tutorial-to-create-spark-cluster.md

File metadata and controls

12 lines (11 loc) · 525 Bytes

xxx=project name say

  1. Gen a key pair: Console > EC2 > Key Pairs > create pair, call it xxx-key-pair, move into ~/.ssh
  2. Console > EMR
  3. Click advanced options
  4. cluster name: xxx-spark-cluster
  5. suggest use memory optimised types: r3.xlarge
  6. Add zeppelin sandbox and spark to the application list
  7. use key pair xxx-key-pair
  8. click create
  9. wait (best to wait until it not provisioning cos it won't have everything installed yet)
  10. when the SSH hyperlink shows up it will give you the command to be able ssh in