Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ignite-tf start issues #1

Open
ghost opened this issue Aug 25, 2020 · 0 comments
Open

ignite-tf start issues #1

ghost opened this issue Aug 25, 2020 · 0 comments

Comments

@ghost
Copy link

ghost commented Aug 25, 2020

Thanks very much for posting this example for distributed training, I've been looking all over for some examples.

I was able to create a multi-node ignite cluster and followed the instructions to load the Cifar data to the cluster however, when I followed the command ./ignite-tf.sh start TEST_DATA models python3 official/resnet/cifar10_main.py it gives me the following result:

Apache Ignite and TensorFlow integration command line tool that allows to
start, maintain and stop distributed deep learning utilizing Apache Ignite
infrastructure and data.
  -c, --config=<cfg>   Apache Ignite client configuration.
  -h, --help           Show this help message and exit.
  -V, --version        Print version information and exit.
Commands:
  start   Starts a new TensorFlow cluster and attaches to user script process.
  stop    Stops a running TensorFlow cluster.
  attach  Attaches to running TensorFlow cluster (user script process).
  ps      Prints identifiers of all running TensorFlow clusters.

also, is there a way to just create a tensorflow cluster on ignite and perform the distributed code on Jupyter notebook?

Thanks,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants