Skip to content

Undersanding the limitations of MultiZarrToZarr #320

Closed Answered by martindurant
croth1 asked this question in Q&A
Discussion options

You must be logged in to vote

zarr itself does not seem to have the concept of virtual datasets

Correct, zarr is a strict hierarchy, so the only way to refer to other nodes in a tree or other trees is via storage layer tricks such as kerchunk. There are some ideas floating around zarr V3 that could allow for it, but I am not aware of a concrete proposal. Currently kerchunk does not even follow HDF references to within the same file, but only for lack of compelling need. ( https://github.com/fsspec/kerchunk/blob/main/kerchunk/hdf.py#L125 is a stub to eventually support this)

with kerchunk's MultiZarrToZarr, one can only combine chunks over a new axis, or if the axis is chunked to 1

Yes, MultiZarrToZarr if geared to…

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by croth1
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@pixalytics
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants