You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
recently I tried reverb with the AMD GPU enhanced Tensorflow.
Doing it the streight forward way:
In completely new venv
pip3 install tensorflow-rocm
pip3 install dm-reverb
got me this error:
>>> import tensorflow
2023-01-21 14:18:31.516305: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: SSE3 SSE4.1 SSE4.2 AVX AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
>>> import reverb
Traceback (most recent call last):
File "/tmp/venv/lib/python3.10/site-packages/reverb/pybind.py", line 4, in <module>
from .libpybind import *
ImportError: /tmp/venv/lib/python3.10/site-packages/reverb/libschema_cc_proto.so: undefined symbol: _ZNK6google8protobuf7Message25InitializationErrorStringB5cxx11Ev
So I tried to recompile reverb myself by basically using an AMD provided docker base image:
ARG cpu_base_image="rocm/tensorflow-build:latest-python3.10-rocm5.4.0".
With a little tinkering (I will provide the details if necessary) everything compiled. But now I'm stuck at linking with the message below.
It's a bit strange as the local compiler seems to have those symbols defined in its libraries.
These build errors are hard to figure out. I doubt you will get much help. Not saying this to be mean. I just believe in transparency as it would frustrate me if I were you. We don't use AMD internally (same with MacOS) so we don't maintain or test it. Reverb is used heavily but is mostly just in maintenance mode given it is "feature complete" and performs well. Here are my only tips. I do the stable builds just because I kind of like doing it.
When I build the reverb nightly and the stable releases I am using the same instructions and code
For the latest stable ROCm the source for reverb would have been at this tag. https://github.com/deepmind/reverb/tree/v0.10.0. I don't think that will fix your issue but you will have the reverb source that was used to build against TensorFlow 2.11 in December.
Wishing you luck. I just fought a horrible build issue that ended up being silly but took me days and asking (begging) some experts to look.
Hello everyone,
recently I tried reverb with the AMD GPU enhanced Tensorflow.
Doing it the streight forward way:
In completely new venv
got me this error:
So I tried to recompile reverb myself by basically using an AMD provided docker base image:
ARG cpu_base_image="rocm/tensorflow-build:latest-python3.10-rocm5.4.0".
With a little tinkering (I will provide the details if necessary) everything compiled. But now I'm stuck at linking with the message below.
It's a bit strange as the local compiler seems to have those symbols defined in its libraries.
The text was updated successfully, but these errors were encountered: