You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks very much for posting this example for distributed training, I've been looking all over for some examples.
I was able to create a multi-node ignite cluster and followed the instructions to load the Cifar data to the cluster however, when I followed the command ./ignite-tf.sh start TEST_DATA models python3 official/resnet/cifar10_main.py it gives me the following result:
Apache Ignite and TensorFlow integration command line tool that allows to
start, maintain and stop distributed deep learning utilizing Apache Ignite
infrastructure and data.
-c, --config=<cfg> Apache Ignite client configuration.
-h, --help Show this help message and exit.
-V, --version Print version information and exit.
Commands:
start Starts a new TensorFlow cluster and attaches to user script process.
stop Stops a running TensorFlow cluster.
attach Attaches to running TensorFlow cluster (user script process).
ps Prints identifiers of all running TensorFlow clusters.
also, is there a way to just create a tensorflow cluster on ignite and perform the distributed code on Jupyter notebook?
Thanks,
The text was updated successfully, but these errors were encountered:
Thanks very much for posting this example for distributed training, I've been looking all over for some examples.
I was able to create a multi-node ignite cluster and followed the instructions to load the Cifar data to the cluster however, when I followed the command
./ignite-tf.sh start TEST_DATA models python3 official/resnet/cifar10_main.py
it gives me the following result:also, is there a way to just create a tensorflow cluster on ignite and perform the distributed code on Jupyter notebook?
Thanks,
The text was updated successfully, but these errors were encountered: