-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TVM RPC will fail when allocating large arrays on an Android phone #7758
Comments
The last error message was generated using the latest version and the following code.
I cannot allocate a single array with the above code. |
It seems that this is due to the Android memory limit for the TVM RPC APP. |
I've met some similar problem like this. This seems to be a system limitation? |
Seems due to the system limitation, please feel free to followup on https://discuss.tvm.apache.org/ |
Hi
I want to deploy the BERT-base model on an Android phone. One of its params has shape (30522, 768) with dtype float32, the RPC connection will be reset each time I allocate this array.
The error message:
The BERT model was imported from Torch
The
optimize_bert
function has the following passes:I also tried the commit #5516 for
ring_buffer.h
, but didn't work.It seems that it will fail when the allocated space is over about 400 MB.
The text was updated successfully, but these errors were encountered: