Windows support for autotvm - Do not merge #4548

jmorrill · 2019-12-19T21:47:18Z

This PR is not meant to give anyone a heart attack. @soiferj encouraged me to submit this PR so he could take a peek. So please don't code review seriously for merge. Feel free to close if you don't want to look at it :)

Two discuss topics related:
https://discuss.tvm.ai/t/unofficial-autotvm-on-windows-guide/4711 (Google doc has notes on quirks)
https://discuss.tvm.ai/t/added-windows-support-to-c-rpc-server/5007

Currently, there is no support for autotvm in Windows out of the box. Most challenges are related to fork() not being supported, which autotvm code uses extensively for getting multi-core performance in python. Having no fork() means data sent to process pools must be able to be pickled. Also, having no fork() means python pools or subprocesses need to be reused for performance reasons. This was very apparent in the local_executor.py, where starting a new python subprocess w/ python entry point could take almost 1000ms.

To overcome these issues I have opted to use pathos library in some spots which uses dill to serialize. dill can serialize much more than pickle, notably functions.

I've tried to keep the linux behavior the same, but have not tested it. Most of the time I "ifdef"ed the python code with os.name == 'nt' so it was easy to spot.

Notable problems are:

Need to fix ipv6 in base.py get_addr_family
If IP_ANY (0.0.0.0) was having trouble, so i replaced IP_ANY with 127.0.0.1
local_executor.py, timeouts are not supported because a pool is used for perf reasons. Timeouts will work on RPC server side if using the C++ RPC server.
I'm new to Python, so things may be able to be expressed better
Took some liberties with C++ RPC and main CMakeLists.txt, which may not be appreciated
Python RPC server, I restart the python subprocess after n-trials as some cuda kernels cause big leaks and killing the proc is the only way to fix. I suggest the C++ RPC server as its much faster.
Possibly many more.

… windows_support

FrozenGene · 2020-01-19T02:48:59Z

I think that's a good idea @FrozenGene . Maybe starting with the CPP server PR first, as it is more contained and less risky?

I can cherry pick to a new PR once the CI verifies my latest changes (resolved some merge conflicts with my branch).

sounds good to me.

soiferj · 2020-02-11T00:12:27Z

@jmorrill have you gotten a chance to work on the CPP server PR?

jmorrill · 2020-02-11T01:48:59Z

@jmorrill have you gotten a chance to work on the CPP server PR?

So sorry @soiferj! It's the time of the year where kids bring home sickness.
Anyways, created a PR here.
#4857

… windows_support

…support

… windows_support

tqchen · 2020-10-11T18:30:32Z

This PR is superseded by another PR to add rpc server support (into the mainline) Thanks @jmorrill for very insightful investigations.

jmorrill and others added 30 commits November 9, 2019 21:50

Initial checkin for Windows support

f2285f2

Merge branch 'master' into windows_support

c674ff9

Work to support autotvm.LocalRunner in Windows

74038ef

Fix line endings

7af065e

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

1bc300c

… windows_support

Fixed socket.h build error

3b0c75a

fixed rpc_tracker exec on Windows. Added code comments to tracker.py

73a3600

Merge branch 'master' into windows_support

329352d

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

0ac2a19

… windows_support

Merge remote-tracking branch 'upstream/master' into windows_support

4a0e8e3

Merge python/tvm/autotvm/measure/measure_methods.py

9b36e05

Merge remote-tracking branch 'upstream/master' into windows_support

9b2d1be

Merge remote-tracking branch 'upstream/master' into windows_support

f8b1243

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

b1d92f7

… windows_support

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

fc0e4fa

… windows_support

Merge remote-tracking branch 'upstream/master' into windows_support

2bcfd45

Optimize process pool usage in xgboost

7202717

Removed timeouts from local executor on Windows

255d46a

Added Windows support to C++ RPC Server

49f5e87

Fix upstream compilation error on MSVC

9846d2c

Fix container.h

38e042e

Merge remote-tracking branch 'upstream/master' into windows_support

534dfac

Merge CMakeLists.txt

6890ff8

XGBoostCostModel crash if num_threads==None

029b5ce

CXX RPC Server fix windows only defs

3d4ed58

Removed unneeded SetThreadPriority for Win32

7ed3745

Changed windows clang compile command. Removed unneeded printf call

ce4111c

Merge remote-tracking branch 'upstream/master' into windows_support

80d899e

Merge remote-tracking branch 'upstream/master' into windows_support

c539d5b

Merge remote-tracking branch 'upstream/master' into windows_support

4fd4061

jmorrill added 2 commits January 28, 2020 16:12

Merge remote-tracking branch 'upstream/master' into windows_support

05548f5

Merge remote-tracking branch 'upstream/master' into windows_support

eee38cc

merge latest from remote. fix up cmakelists.txt

8be20b4

jmorrill mentioned this pull request Feb 11, 2020

Windows Support for cpp_rpc #4857

Merged

masahi mentioned this pull request Feb 17, 2020

[WINDOWS][AutoTVM] OSError: [WinError 10048] Only one usage of each socket address (protocol/network address/port) is normally permitted and OSError: [WinError 10049] The requested address is not valid in its context #4821

Closed

jmorrill and others added 20 commits February 17, 2020 14:19

Merge remote-tracking branch 'upstream/master' into windows_support

5e0b247

Merge remote-tracking branch 'upstream/master' into windows_support

6840029

Fixup task.py from merge

292c218

Merge remote-tracking branch 'upstream/master' into windows_support

65bbd9e

Change Windows untar to use python vs WSL

4a329df

Merge remote-tracking branch 'upstream/master' into windows_support

cc0b1e7

Remove export all symbols in main cmake

85796dd

Merge remote-tracking branch 'upstream/master' into windows_support

69d6955

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

e53e224

… windows_support

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

f71a01e

… windows_support

Merge commit '4683c3f55c51e3e79fa3b099ae7a764130be261e' into windows_…

7523514

…support

Merge commit '03ff0cd06051262bebedab7592729f2cf3ed87e8' into windows_…

1e0bbb9

…support

Merge commit '316ce055ce11ae5ecb2d02a1438df26a5ef4ef4a' into windows_…

53e48af

…support

Merge commit 'b796c13ccc7f24eadb2c61738060fe389c3e72c9' into windows_…

89e820d

…support

Merge commit '54975a3fd24fa45b815be39075f4614e53009444' into windows_…

c224e53

…support

Merge commit '6b840fa9672124bebccff5322a59ab1f159e74b8' into windows_…

85d7f42

…support

Merge commit 'e63e08febd682f40a536075998a6839bccccd3c6' into windows_…

7ba2253

…support

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

72c2807

… windows_support

removed exporting for _tvm_main_ to get successful builds on Windows

1c786cf

Merge branch 'master' of https://github.com/apache/incubator-tvm into…

c727fb1

… windows_support

tqchen closed this Oct 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Windows support for autotvm - Do not merge #4548

Windows support for autotvm - Do not merge #4548

jmorrill commented Dec 19, 2019

FrozenGene commented Jan 19, 2020

soiferj commented Feb 11, 2020

jmorrill commented Feb 11, 2020

tqchen commented Oct 11, 2020 •

edited

Loading

Windows support for autotvm - Do not merge #4548

Windows support for autotvm - Do not merge #4548

Conversation

jmorrill commented Dec 19, 2019

FrozenGene commented Jan 19, 2020

soiferj commented Feb 11, 2020

jmorrill commented Feb 11, 2020

tqchen commented Oct 11, 2020 • edited Loading

tqchen commented Oct 11, 2020 •

edited

Loading