-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[data] Add LanceDB Datasource #44853
[data] Add LanceDB Datasource #44853
Commits on Apr 18, 2024
-
add: New LanceDB datasource for Ray Data
This PR adds a new datasource for Ray Data that reads from LanceDB. This datasource is a thin wrapper around the LanceDB Python client that allows users to read data from LanceDB into Ray Data. On branch anyscalebrent/lancedb_datasource Changes to be committed: modified: python/ray/data/__init__.py modified: python/ray/data/datasource/__init__.py new file: python/ray/data/datasource/lancedb_datasource.py modified: python/ray/data/read_api.py
Configuration menu - View commit details
-
Copy full SHA for be6051e - Browse repository at this point
Copy the full SHA be6051eView commit details -
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/__init__.py modified: python/ray/data/datasource/lancedb_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 2db26aa - Browse repository at this point
Copy the full SHA 2db26aaView commit details -
upd: read_api.py - fix linting errors with line length
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/read_api.py
Configuration menu - View commit details
-
Copy full SHA for 50f2bef - Browse repository at this point
Copy the full SHA 50f2befView commit details
Commits on Apr 23, 2024
-
upd: rename lancedb resources to lance resources
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/__init__.py modified: python/ray/data/datasource/__init__.py renamed: python/ray/data/datasource/lancedb_datasource.py -> python/ray/data/datasource/lance_datasource.py modified: python/ray/data/read_api.py
Configuration menu - View commit details
-
Copy full SHA for 43f3dd2 - Browse repository at this point
Copy the full SHA 43f3dd2View commit details -
upd: Additional updates to remove DB from Lance resources
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/__init__.py modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 4577925 - Browse repository at this point
Copy the full SHA 4577925View commit details -
upd: Additional updates to remove DB from lance name
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/__init__.py modified: python/ray/data/datasource/__init__.py
Configuration menu - View commit details
-
Copy full SHA for e2d7419 - Browse repository at this point
Copy the full SHA e2d7419View commit details -
Configuration menu - View commit details
-
Copy full SHA for 94ce5f0 - Browse repository at this point
Copy the full SHA 94ce5f0View commit details -
upd: Lance ReadAPI comment for AZ support
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/read_api.py Signed-off-by: Brent Bain <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5f4b253 - Browse repository at this point
Copy the full SHA 5f4b253View commit details -
upd: Include limk to LanceDB docs in read_api.py
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/read_api.py
Configuration menu - View commit details
-
Copy full SHA for 486b71e - Browse repository at this point
Copy the full SHA 486b71eView commit details -
upd: lance_datasource - remove header comment
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 55ce8e6 - Browse repository at this point
Copy the full SHA 55ce8e6View commit details -
upd: Change init params to Optional instead of Unions
Signed-off-by: Brent Bain <[email protected]> The __init__ method of the LanceDatasource class now uses Optional instead of Union for the parameters. Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 2afef9a - Browse repository at this point
Copy the full SHA 2afef9aView commit details
Commits on Apr 24, 2024
-
upd: lance_datasource - change to use to_batches
This change updates lance_datasource to a simpler implementation of to_batches. Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 757113a - Browse repository at this point
Copy the full SHA 757113aView commit details -
upd: lance_datasource - set parallelism based on number of fragments
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 931c6fe - Browse repository at this point
Copy the full SHA 931c6feView commit details -
upd: lance_datasource - change from yield to return
Yield isn't working as expected. Changing back to return. Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 5de8caa - Browse repository at this point
Copy the full SHA 5de8caaView commit details -
upd: lance_dataset comment - changing for consistent naming
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for db4c528 - Browse repository at this point
Copy the full SHA db4c528View commit details -
upd: lance_datasource - changed how fragment reading is performed
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for f39b18e - Browse repository at this point
Copy the full SHA f39b18eView commit details -
upd: lance datasource - comments updated
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for 9b07881 - Browse repository at this point
Copy the full SHA 9b07881View commit details -
upd: lance_datasource Add storage options to pass to Lance
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py
Configuration menu - View commit details
-
Copy full SHA for cf3e700 - Browse repository at this point
Copy the full SHA cf3e700View commit details
Commits on Apr 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 02b4835 - Browse repository at this point
Copy the full SHA 02b4835View commit details -
Changes to lance_datasource parallelism handling. Added initial test for lance_datasource. Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py modified: python/ray/data/read_api.py new file: python/ray/data/tests/test_lance.py
Configuration menu - View commit details
-
Copy full SHA for 072ca1c - Browse repository at this point
Copy the full SHA 072ca1cView commit details -
upd: lance tests linting updates
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/datasource/lance_datasource.py modified: python/ray/data/tests/test_lance.py
Configuration menu - View commit details
-
Copy full SHA for eb07726 - Browse repository at this point
Copy the full SHA eb07726View commit details -
upd: data-test-requirements - add lancedb
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/requirements/ml/data-test-requirements.txt
Configuration menu - View commit details
-
Copy full SHA for a26625a - Browse repository at this point
Copy the full SHA a26625aView commit details -
upd: data BUILD - and Lance test
Signed-off-by: Brent Bain <[email protected]> Changes to be committed: modified: python/ray/data/BUILD
Configuration menu - View commit details
-
Copy full SHA for 9b0d4d1 - Browse repository at this point
Copy the full SHA 9b0d4d1View commit details