You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Modify the condition to evaluate the case we are NOT on local FS. Doing this way it's wasb(s) or s3 compatible if the cluster is based on this kind of distributed filesystem
Modify the condition to evaluate the case we are NOT on local FS. Doing this way it's wasb(s) or s3 compatible if the cluster is based on this kind of distributed filesystem
We identified that the problem is that code currently checks if the FileSystem.getScheme() is "hdfs" to know if it is running in local or distributed mode.
However, when running on cloud distributions (AWS EMR, Azure HDInsight, Google DataProc, and probably Databricks too) the scheme is not "hdfs" (but instead "s3", "wasbs", "gs", etc.).
The above pull requests fixes that. (We only tested it on WASB(s) but we believe it should fix the issue for other clouds as well).
It would be nice to have support for WASB(s) scheme.
The text was updated successfully, but these errors were encountered: