Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File download Problem in spark executor in cluster mode #47

Open
rabi112 opened this issue Feb 12, 2019 · 2 comments
Open

File download Problem in spark executor in cluster mode #47

rabi112 opened this issue Feb 12, 2019 · 2 comments

Comments

@rabi112
Copy link

rabi112 commented Feb 12, 2019

In the case of Spark Driver and Executor is in the same system then file downloaded successfully to /tmp/ location and executor also get that file. But in the case of the executor in a different system then file not downloaded in executor but successfully download in the driver. We are getting the exception in executor
FileScanRDD:54 - Reading File path: file:///tmp/1546428988_Monthly_salary_csv.csv, range: 0-125929, partition values: [empty row] 2019-02-12 07:32:18 ERROR Executor:91 - Exception in task 0.3 in stage 56.0 (TID 85) java.io.FileNotFoundException: File file:/tmp/1546428988_Monthly_salary_csv.csv does not exist

@samuel-pt
Copy link
Contributor

@bini0209
Can you provide temp folder as a parameter ?
You can use "tempLocation" parameter for pass the tempFolder location

@mbidewell
Copy link

I'm seeing this problem as well, I've set tempLocation. The issue is that for some reason the file is downloaded to the driver so that when the load attempts to create the dataframe on the cluster, the file is not found.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants