-
Notifications
You must be signed in to change notification settings - Fork 859
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Windows 10 Pro, CUDA 11.0 +PyTorch 1.7.1+ issue #866
Comments
@jeffxtang, can you now verify and close this issue |
you mean to test again and verify if the problem is gone? or just verify what @maaquib - thanks - posted for me and close it? i may not have time to test this again today.. |
Did another sanity test with the commit 2541292 and was stuck at 82% EXECUTING for about an hour. Full log is attached. Is this still caused by "OSError: [WinError 10013] An attempt was made to access a socket in a way forbidden by its access permissions"? |
About the error "OSError: [WinError 10013] An attempt was made to access a socket in a way forbidden by its access permissions", it was mentioned at #828 (comment) and the Windows Native TroubleShooting may help - "If you are building from source then you may have to change the port number for inference, management and metrics apis as specified in frontend/server/src/test/resources/config.properties"
|
@jeffxtang: For running on HTTPS, you will need to generate and provide the |
From the shared logs, I am still observing the same error at different places and a bunch of test cases have failed.
TorchServe Sanity suite executes a bunch of integration test cases which uses different ports to load the model workers. In your case it seems your user doesn't have access to some ports or the ports required are already in use by some other process and hence the errors. We haven't observed any such error in out build/test machines/environments, which has full permissions for accessing the resources. From the other threads, I understand that you were able to successfully install from source and run basic model serving through REST as well as gRPC APIs and its the frontend build which is failing for you? |
@harshbafna yes I'm able to run the basic model serving with REST and GRPC APIs but haven't made the sanity test work. I'll try to make https work to see it'll fix the problem. |
I followed the configuration Enable SSL's Examples' two options and retried the sanity test but for option 1 I was still stuck at 82% with the same errors and for option 2, the test finished quickly with the messages
|
@jeffxtang What was the failed test? Can you provide the logs? |
|
Could you also zip and attache the reports generated at following path:
From the shared logs, it says it ran into some FileNotFound exception while trying to start the test suite.
You should not make any changes in the config.properties file for running the test suites, it will mess up the the expected outputs. The reason for pointing you to that doc was that you were trying to use The sanity suite/regression suite or any test case should be executed without any changes in the congif, unless you have made corresponding changes in the code. To run the sanity suite all you need is to run |
On behalf of @jeffxtang
On my Windows 10 Pro, I had CUDA 11.0 and PyTorch 1.7.1+ (it was 1.7.0 but was upgraded to 1.7.1 when I ran
python .\ts_scripts\install_dependencies.py --environment=dev
, with == changed to >= in the requirements\torch.txt)The first time I ran
python .\torchserve_sanity.py
, 151 tests completed, 20 failed - result win_gpu_torch171_sanity_test.txtIn the TorchServe on Windows Troubleshooting, it says "you may have to change the port number for inference, management and metrics apis as specified in frontend/server/src/test/resources/config.properties, all files in frontend/server/src/test/resources/snapshot/* and frontend/server/src/main/java/org/pytorch/serve/util/ConfigManager.java, but it's unclear exactly how to make the changes. So I discarded the requirements/torch.txt changes, and installed torch 1.6.0 etc and reran the sanity test. The result is in win_gpu_torch160_sanity_test.txt, with the same number of failed tests, which may be caused by the same error
OSError: [WinError 10013] An attempt was made to access a socket
reported in issue #828.Originally posted by @jeffxtang in #851 (comment)
The text was updated successfully, but these errors were encountered: