You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The same code works for ray.core with setting runtime_env to my working dir but it doesn't work for ray.data during ray.init()
Versions / Dependencies
2.36.1
Reproduction script
I am using ray.data modules and I have many ray objects and Im creating ray dataset dataset = ray.data.from_pandas_refs(ray_objs)
After that; Im using that dataset as:
for bs in datset_ref.iter_batches(prefetch_batches=0, batch_size=1000, batch_format="pandas", drop_last=False):
pass
When doing that it tries to calculate metrics of those dataset and I keep getting these error:
ModuleNotFoundError: No module named 'MY_WORKING_DIR'
(Actor pid=xxx) Unhandled error (suppress with 'RAY_IGNORE_UNHANDLED_ERRORS=1'): ray::_StatsActor.update_metrics()
At least one of the input arguments for this task could not be computed:
(Actor pid=xxx) ray.exceptions.RaySystemError: System error: No module named 'MY_WORKING_DIR'
I have set my working_dir as ray-runtime env during ray.init() by using this:
I don't get any error regarding No module named to my Project folder when using ray.core library but when I use ray.data library, it doesn't work and I keep getting that exception:
Is there a workaround to disable the metrics only from ray.data
Issue Severity
High: It blocks me from completing my task.
The text was updated successfully, but these errors were encountered:
pradipneupane
added
bug
Something that is supposed to be working; but isn't
triage
Needs triage (eg: priority, bug/not-bug, and owning component)
labels
Dec 19, 2024
What happened + What you expected to happen
The same code works for ray.core with setting runtime_env to my working dir but it doesn't work for ray.data during ray.init()
Versions / Dependencies
2.36.1
Reproduction script
I am using ray.data modules and I have many ray objects and Im creating ray dataset
dataset = ray.data.from_pandas_refs(ray_objs)
After that; Im using that dataset as:
When doing that it tries to calculate metrics of those dataset and I keep getting these error:
I have set my working_dir as ray-runtime env during ray.init() by using this:
I don't get any error regarding No module named to my Project folder when using ray.core library but when I use ray.data library, it doesn't work and I keep getting that exception:
Is there a workaround to disable the metrics only from ray.data
Issue Severity
High: It blocks me from completing my task.
The text was updated successfully, but these errors were encountered: