Make datasets work with chunk wise data processing #1980

merelcht · 2022-10-26T13:34:12Z

Description

Kedro nodes currently can only accept already loaded data, but not lazily loaded one (or at least that's how the standard approach is). We should investigate if there's any changes needed in the design of the datasets, so Kedro nodes can accept iterators as arguments and yield data, so it can be loaded-processed-saved chink-wise.

The text was updated successfully, but these errors were encountered:

merelcht · 2023-07-05T10:15:14Z

Completed in #2161

merelcht added Issue: Feature Request New feature or improvement to existing feature Component: IO Issue/PR addresses data loading/saving/versioning and validation, the DataCatalog and DataSets labels Oct 26, 2022

merelcht added this to the Redesign Catalog and Datasets milestone Oct 26, 2022

merelcht mentioned this issue Oct 26, 2022

Re-design io.core and io.data_catalog #1778

Open

merelcht closed this as completed Jul 5, 2023

merelcht removed this from the Redesign the API for IO (catalog) milestone Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make datasets work with chunk wise data processing #1980

Make datasets work with chunk wise data processing #1980

merelcht commented Oct 26, 2022 •

edited by idanov

Loading

merelcht commented Jul 5, 2023

Make datasets work with chunk wise data processing #1980

Make datasets work with chunk wise data processing #1980

Comments

merelcht commented Oct 26, 2022 • edited by idanov Loading

Description

merelcht commented Jul 5, 2023

merelcht commented Oct 26, 2022 •

edited by idanov

Loading