-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CONV] Asymmetric padding #4511
Conversation
For For For |
@Huyuwei It would be nice you could help to review too. |
@FrozenGene I have addressed your comments, please go on review. Thanks! |
Thanks. Could you help to resolve the conflict? cc @optima2005 @zhiics @yzhliu @Huyuwei anyone could help to review and handle it? |
25c3af9
to
b366641
Compare
rebase to master to fix conflict. |
@optima2005 could you rebase again? sorry for the delayed review |
b366641
to
421a290
Compare
@yzhliu I have rebased to master. Thanks! |
Cannot reproduce the ci failure in my local box. How to recheck? |
@optima2005 do you have gpu instance? if it's the environment difference, docker is probably the easiest way to reproduce. |
421a290
to
a492b9b
Compare
@yzhliu, I have GPU instance and using the docker environment. I am wondering if the failure is occasional and might be due to the CI environment. |
Thanks @optima2005 @FrozenGene |
NOTE: there has been two CI failures on the master since this PR get merged.
Both are on the CI instance aws.g4.n0.cuda0 , @yzhliu @zhiics it would be great if you can confirm if such error can be reproduced. It could either due to a real problem, which we need to address, or due to some caching issues(as i see this PR changes the memoize format), it can be confirmed by running docker from a clean folder. |
@tqchen Sure. Let me log into the machine and run the unit test from the master to see if it fails or not. |
@tqchen I just tested the lasted master branch locally and also tried to execute some of the failed tests in the the docker image. They both worked well. Let me clean the folder and enable the CI again to see if it solves the problem. |
Seems we still get some flaky error, see https://ci.tvm.ai/blue/rest/organizations/jenkins/pipelines/tvm/branches/PR-4635/runs/2/nodes/244/log/?start=0 @zhiics would be great if you can try to confirm again |
@tqchen Sure. I will double check tonight once I get home. |
Note, we might need to run the exact script as the error could due to interference between tests |
Yeah, I can confirm that it fails with the following error:
when I ran I can get many this type of errors when I just run
|
Not sure if it is because we don't have enough swap memory on it. I can create a swap space and see if it works. |
@zhiics The nvcc error might be irrelevant, we should focus on the error within the docker container given that error was pretty recent(could cause by this merge or a few one before it) can you first run a git history bisect to see if a particular commit caused the error? Also if the segfault on topi/tests/python/test_topi_depthwise_conv2d_back_weight.py is reproducible, it would be great to get a stack trace(you can install gdb on the docker and run python using gdb) |
also we need to update the tophub log files. workload does not match after the change as the padding dimension now becomes 4D |
opened a new thread #4646 to track this. |
* [CONV] Asymmetic padding * fix lint error * update for legalize, rocm and cudnn * add more test cases * change more symmetric padding * change conv2d winograd tests according orginal cases * remove 'alter_op_layout.h' header in bitserial.cc
* [CONV] Asymmetic padding * fix lint error * update for legalize, rocm and cudnn * add more test cases * change more symmetric padding * change conv2d winograd tests according orginal cases * remove 'alter_op_layout.h' header in bitserial.cc
* [CONV] Asymmetic padding * fix lint error * update for legalize, rocm and cudnn * add more test cases * change more symmetric padding * change conv2d winograd tests according orginal cases * remove 'alter_op_layout.h' header in bitserial.cc
The PR is to implement the 1st item in issue #2682.
I tried to find as more as possible the places where padding is handled as symmetric. But there may be some missing. Please comment is your find more.
The following module/file are kept using symmetic padding due to interface limitaion or I am not sure if changing to asymmetric would cause any trouble.
topi/python/topi/x86/conv2d_alter_op.py
topi/python/topi/rocm/conv2d.py
topi/include/topi/nn.h
convolution with 'cudnn' lib.