Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[data] Experimental strict schema mode #34336

Merged
merged 42 commits into from
Apr 18, 2023
Merged

Conversation

ericl
Copy link
Contributor

@ericl ericl commented Apr 12, 2023

Why are these changes needed?

Add an experimental strict schema mode for Datasets, which can be enabled with RAY_DATA_STRICT_MODE=1. In this mode,

  • All read_apis return structured data, never standalone Python objects.
  • Standalone Python objects are prohibited from being returned from map / map_batches.
  • Standalone Numpy arrays are prohibited from being returned from map / map_batches.
  • There is no more special interpretation of single-column schema containing just __value__ as a column.
  • The default batch format is "numpy".
  • schema() returns a unified Schema class.

Closes #34175

Signed-off-by: Eric Liang <[email protected]>
@ericl ericl added tests-ok The tagger certifies test failures are unrelated and assumes personal liability. and removed tests-ok The tagger certifies test failures are unrelated and assumes personal liability. labels Apr 12, 2023
Signed-off-by: Eric Liang <[email protected]>
@ericl ericl added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Apr 18, 2023
@ericl ericl merged commit 74c9fd6 into ray-project:master Apr 18, 2023
elliottower pushed a commit to elliottower/ray that referenced this pull request Apr 22, 2023
ProjectsByJackHe pushed a commit to ProjectsByJackHe/ray that referenced this pull request May 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
tests-ok The tagger certifies test failures are unrelated and assumes personal liability.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Data] UX improvements
5 participants