You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've just about explored every possible angle I can think of on this problem except for my hardware (but i am not sure where I'd begin with this?) and I am not making any progress. I have a feeling it may be a compatibility issue but I am not sure? Anyway, I have written the code in the example linked below:
more or less line for line. I ran it in VS code in a conda environment, having installed pip and then pip installed the appropriate packages within the environment. The code runs perfectly until it reaches the dataset in the reverb buffer and then it simply freezes and fails to progress any further without ever throwing an error or leaving any signs as to what might be happening. The exact same problem occurred when I ran my own version of this code in a project I am doing. Everything grinds to a halt at next(iterator)
(tf_tutorial) harry@harry-Aspire-A315-58:~/Documents/Reinforcement Learning/tf$ python intro.py
2024-12-17 14:43:38.622260: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.
2024-12-17 14:43:38.624297: I external/local_tsl/tsl/cuda/cudart_stub.cc:31] Could not find cuda drivers on your machine, GPU will not be used.
2024-12-17 14:43:38.651939: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
2024-12-17 14:43:38.651970: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
2024-12-17 14:43:38.652896: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
2024-12-17 14:43:38.657333: I external/local_tsl/tsl/cuda/cudart_stub.cc:31] Could not find cuda drivers on your machine, GPU will not be used.
2024-12-17 14:43:38.657493: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2024-12-17 14:43:39.165679: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
[reverb/cc/platform/tfrecord_checkpointer.cc:162] Initializing TFRecordCheckpointer in /tmp/tmpyir39eez.
[reverb/cc/platform/tfrecord_checkpointer.cc:565] Loading latest checkpoint from /tmp/tmpyir39eez
[reverb/cc/platform/default/server.cc:71] Started replay server on port 42883
iterator next
[reverb/cc/client.cc:165] Sampler and server are owned by the same process (21034) so Table uniform_table is accessed directly without gRPC.
[reverb/cc/client.cc:165] Sampler and server are owned by the same process (21034) so Table uniform_table is accessed directly without gRPC.
[reverb/cc/client.cc:165] Sampler and server are owned by the same process (21034) so Table uniform_table is accessed directly without gRPC.
[reverb/cc/client.cc:165] Sampler and server are owned by the same process (21034) so Table uniform_table is accessed directly without gRPC.
[reverb/cc/client.cc:165] Sampler and server are owned by the same process (21034) so Table uniform_table is accessed directly without gRPC.
[reverb/cc/client.cc:165] Sampler and server are owned by the same process (21034) so Table uniform_table is accessed directly without gRPC.
I guess that the issue is hardware related. Your computer's CPU seems to be missing capabilities AVX2 AVX512F AVX512_VNNI FMA, required by the pre-built version of Tensorflow 2.15.1. You can build TF for your specific CPU from source, however you will very likely have a degraded performance. Please note that the CPU features are unrelated to CUDA / GPU support, which you do not have either.
Hi, thanks for your response. I'm thinking it might be either hardware related or down to the fact I'm using Ubuntu perhaps? Tried running it from source and I'm now getting strange errors. Usually relating to gcc. Fixing one error makes another pop up in its place. Not sure what to do. What would you recommend?
I've just about explored every possible angle I can think of on this problem except for my hardware (but i am not sure where I'd begin with this?) and I am not making any progress. I have a feeling it may be a compatibility issue but I am not sure? Anyway, I have written the code in the example linked below:
https://github.com/tensorflow/agents/blob/528cef7c4aedf54158a0564fdca446fe9942aa2a/docs/tutorials/1_dqn_tutorial.ipynb
more or less line for line. I ran it in VS code in a conda environment, having installed pip and then pip installed the appropriate packages within the environment. The code runs perfectly until it reaches the dataset in the reverb buffer and then it simply freezes and fails to progress any further without ever throwing an error or leaving any signs as to what might be happening. The exact same problem occurred when I ran my own version of this code in a project I am doing. Everything grinds to a halt at next(iterator)
My code can be seen below:
the terminal output was as follows:
the conda list output:
The text was updated successfully, but these errors were encountered: