Input Data File Location #113

himanshurajput2 · 2016-08-01T23:16:43Z

Hello,

I am working on spark on yarn setup and running k-means algorithm. I want to know the location of the input data file generated by spark-perf or it is in memory only?

Thanks

dcvan24 · 2017-02-20T17:55:03Z

Hi, I have the same question. It seems the data will be read from/written to the HDFS specified in config.py. But I didn't see any files created in HDFS during the test. Is the input dataset created on-the-fly, or do we need to populate the datasets in HDFS before running the test? If it is the latter, anyone knows where the test datasets are? Thx!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input Data File Location #113

Input Data File Location #113

himanshurajput2 commented Aug 1, 2016

dcvan24 commented Feb 20, 2017

Input Data File Location #113

Input Data File Location #113

Comments

himanshurajput2 commented Aug 1, 2016

dcvan24 commented Feb 20, 2017