Tracking issues of aligning storage support with iceberg-java #408

Xuanwo · 2024-06-19T10:08:15Z

iceberg-java now supports

Although OpenDAL supports more storage services than this, it still makes sense to at least support all existing storage services. This issue will track the progress.

After this been implemented, iceberg-rust will have the same storage support level as iceberg-java. I'm willing implement those features but also open to help review related changes. Please comment if you want to join the development and pick up one of them.

Xuanwo · 2024-06-19T10:09:29Z

cc @liurenjie1024, I'm not sure about your 0.3 release plan going. Maybe we can include this one inside?

Most changes should be easy and no API changes. It's also fine to be included in the following 0.3.x releases.

liurenjie1024 · 2024-06-19T13:21:18Z

Hi, @Xuanwo There are two places to track 0.3 features:

I'm ok with waiting for adding this into 0.3 release. I'm just curious how to test against these? Or maybe we can start with declaring these features as experimental.

Xuanwo · 2024-06-19T13:31:28Z

I'm just curious how to test against these? Or maybe we can start with declaring these features as experimental.

aws s3: tested by minio now. We can add real s3 bucket in with sponsor.
aliyun oss: need an oss bucket (better to locate near us-east-1)
azure adlsv2: can be tested by Azurite. And I'm willing to provide test infra as Microsoft MVP.
gcp gcs: need a gcs bucket (better to locate near us-east-1)
hadoop hdfs: can setup in CI directly (thanks open source!)

I agree that we can label these features as experimental. Setting up the CI infrastructure requires time, more so than implementing those features.

sdd · 2024-06-19T13:37:33Z

I have a question on the current FileIO - @Xuanwo is probably the right person to ask here.

It would be useful to be able to customize the OpenDAL Operator by being able to attach layers. Could we extend expose this capability somewhere? I've more than happy to work on this.

liurenjie1024 · 2024-06-19T13:41:21Z

I'm just curious how to test against these? Or maybe we can start with declaring these features as experimental.

aws s3: tested by minio now. We can add real s3 bucket in with sponsor.

aliyun oss: need an oss bucket (better to locate near us-east-1)

azure adlsv2: can be tested by Azurite. And I'm willing to provide test infra as Microsoft MVP.

gcp gcs: need a gcs bucket (better to locate near us-east-1)

hadoop hdfs: can setup in CI directly (thanks open source!)

I agree that we can label these features as experimental. Setting up the CI infrastructure requires time, more so than implementing those features.

Cool, let's move!

Xuanwo · 2024-06-19T13:41:35Z

It would be useful to be able to customize the OpenDAL Operator by being able to attach layers. Could we extend expose this capability somewhere? I've more than happy to work on this.

Any detailed ideas? Are you talking about enabling some existing layers for opendal or allow users to implement something new based on FileIO?

I can imagine that enabling logging and retry layers by default or by configuring might be useful.

Xuanwo · 2024-06-19T13:44:19Z

I agree that we can label these features as experimental. Setting up the CI infrastructure requires time, more so than implementing those features.

Split into a new issue: #410.

I plan to track them after 0.3 release.

jsimbadev · 2024-07-04T19:32:45Z

@Xuanwo I can take the Azure datalake FileIO Implementation + the corresponding infrastructure set up, sound ok?

Xuanwo · 2024-07-05T02:18:10Z

@Xuanwo I can take the Azure datalake FileIO Implementation + the corresponding infrastructure set up, sound ok?

Welcome, have fun!

liurenjie1024 · 2024-08-06T02:28:56Z

cc @Xuanwo Do you still plan to finish this before in 0.3.0? Or we can postpone it to next release?

Xuanwo · 2024-08-06T02:40:11Z

cc @Xuanwo Do you still plan to finish this before in 0.3.0? Or we can postpone it to next release?

There are some more work to do at opendal side. I believe we can let 0.3.0 go first.

Xuanwo self-assigned this Jun 19, 2024

Xuanwo added good first issue Good for newcomers help wanted Extra attention is needed labels Jun 19, 2024

liurenjie1024 added this to the 0.3.0 Release milestone Jun 19, 2024

liurenjie1024 mentioned this issue Jun 19, 2024

Tracking issues of iceberg-rust v0.3.0 #348

Closed

73 tasks

Xuanwo mentioned this issue Jun 19, 2024

Tracking issues of storage backend integration tests #410

Open

5 tasks

jdockerty mentioned this issue Aug 4, 2024

feat: support for gcs storage #520

Merged

liurenjie1024 removed this from the 0.3.0 Release milestone Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issues of aligning storage support with iceberg-java #408

Tracking issues of aligning storage support with iceberg-java #408

Xuanwo commented Jun 19, 2024 •

edited

Loading

Xuanwo commented Jun 19, 2024 •

edited

Loading

liurenjie1024 commented Jun 19, 2024

Xuanwo commented Jun 19, 2024 •

edited

Loading

sdd commented Jun 19, 2024

liurenjie1024 commented Jun 19, 2024

Xuanwo commented Jun 19, 2024 •

edited

Loading

Xuanwo commented Jun 19, 2024 •

edited

Loading

jsimbadev commented Jul 4, 2024

Xuanwo commented Jul 5, 2024

liurenjie1024 commented Aug 6, 2024

Xuanwo commented Aug 6, 2024

Tracking issues of aligning storage support with iceberg-java #408

Tracking issues of aligning storage support with iceberg-java #408

Comments

Xuanwo commented Jun 19, 2024 • edited Loading

Xuanwo commented Jun 19, 2024 • edited Loading

liurenjie1024 commented Jun 19, 2024

Xuanwo commented Jun 19, 2024 • edited Loading

sdd commented Jun 19, 2024

liurenjie1024 commented Jun 19, 2024

Xuanwo commented Jun 19, 2024 • edited Loading

Xuanwo commented Jun 19, 2024 • edited Loading

jsimbadev commented Jul 4, 2024

Xuanwo commented Jul 5, 2024

liurenjie1024 commented Aug 6, 2024

Xuanwo commented Aug 6, 2024

Xuanwo commented Jun 19, 2024 •

edited

Loading

Xuanwo commented Jun 19, 2024 •

edited

Loading

Xuanwo commented Jun 19, 2024 •

edited

Loading

Xuanwo commented Jun 19, 2024 •

edited

Loading

Xuanwo commented Jun 19, 2024 •

edited

Loading