Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: bug in object store memory validation #332

Merged
merged 2 commits into from
Jul 5, 2022

Conversation

davidxia
Copy link
Contributor

@davidxia davidxia commented Jul 3, 2022

We are comparing memory to K8s Pod storage request
instead of memory request.

Add tests.

Expand error message with more information.
This error message is used in K8s events.
More information here might be helpful.

Update comments and log messages for clarity and smoother English usage.

Make repeated strings constants.

Checks

  • I've made sure the tests are passing.
  • Testing Strategy
    • Unit tests
    • Manual tests
    • This PR is not tested :(

@davidxia davidxia marked this pull request as ready for review July 3, 2022 20:46
@DmitriGekhtman
Copy link
Collaborator

Noting some things (which will be added to docs):

It's safest for Ray containers to set memory limits = memory request -- this is the surest way to avoid any inconsistencies in Ray's internal resource accounting. Ray and KubeRay generally use container resource limits to determine resource capacity. A large gap between requests and limits may introduce an inconsistency between what Ray thinks is available and what is actually available.

It's usually best not to configure object store memory size manually -- best to leave it to Ray to configure it. That's done here.

We are comparing memory to K8s Pod storage request
instead of memory request.

Add tests.

Expand error message with more information.
This error message is used in K8s events.
More information here might be helpful.

Update comments and log messages for clarity and smoother English usage.

Make repeated strings constants.
@Jeffwan Jeffwan merged commit 6a85afc into ray-project:master Jul 5, 2022
@davidxia davidxia deleted the dxia/patch4 branch July 6, 2022 11:55
lowang-bh pushed a commit to lowang-bh/kuberay that referenced this pull request Sep 24, 2023
* fix: bug in object store memory validation

We are comparing memory to K8s Pod storage request
instead of memory request.

Add tests.

Expand error message with more information.
This error message is used in K8s events.
More information here might be helpful.

Update comments and log messages for clarity and smoother English usage.

Make repeated strings constants.

* fixup: address PR feedback
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants