Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] add Trino JDBC and\or Arrow Flight SQL JDBC or ADBC  🚀 #7695

Open
2 of 8 tasks
YuriyGavrilov opened this issue Sep 19, 2024 · 9 comments
Open
2 of 8 tasks

Comments

@YuriyGavrilov
Copy link

YuriyGavrilov commented Sep 19, 2024

Search before asking

  • I had searched in the feature and found no similar feature requirement.

Description

First thing is to add existing Trino JDBC Drives Support as source and sink connector.

  • Add Trino JDBC as source connector
  • Add Trino JDBC as sink connector
  • Think about Trino as engine (maybe)
  • Add Arrow Flight SQL as source connector ( as JDBC or ADBC ) @sunxiaojian
  • Add Arrow Flight SQL as sink connector ( as JDBC or ADBC ) @sunxiaojian

Just to leave this here:

Some good example of Flight SQL server as reference:

Usage Scenario

  • Data synchronization from Trino
  • Data synchronization to Trino
  • Using Trino as SQL engine for transformer ( including data enrichment from source to sink by existing data in Trino )
  • Fast data synchronization by Arrow Flight support

Related issues

About Trino Dialect - #4162

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@sanjaychitransh
Copy link

hi ,I have completed trino connector development, tested working
I will submit PR

@YuriyGavrilov
Copy link
Author

@sanjaychitransh great job thx!

@sunxiaojian
Copy link
Contributor

Is there a real need to synchronize Trino data ? Trino itself can connect everything.

@YuriyGavrilov
Copy link
Author

Good question @sunxiaojian
For me there could be cases take data from Trino to somewhere outside.

@sunxiaojian
Copy link
Contributor

Good question @sunxiaojian For me there could be cases take data from Trino to somewhere outside.

I understand that Trino itself does not store data; instead, its data sources are the connector components it supports. Synchronizing Trino data essentially involves synchronizing the data from these underlying plugins. Does this mean we can bypass Trino and directly synchronize data from the underlying storage systems?

@YuriyGavrilov
Copy link
Author

Good question @sunxiaojian For me there could be cases take data from Trino to somewhere outside.

I understand that Trino itself does not store data; instead, its data sources are the connector components it supports. Synchronizing Trino data essentially involves synchronizing the data from these underlying plugins. Does this mean we can bypass Trino and directly synchronize data from the underlying storage systems?

Hi @sunxiaojian Trino could join different source and make some views so it could be complicated to find exactly source attributes in one seatunnel source.

Additionally I make some tests with iceberg table and it is working solution with bypass strategy to sink data to Trino iceberg.

@sunxiaojian
Copy link
Contributor

Good question @sunxiaojian For me there could be cases take data from Trino to somewhere outside.

I understand that Trino itself does not store data; instead, its data sources are the connector components it supports. Synchronizing Trino data essentially involves synchronizing the data from these underlying plugins. Does this mean we can bypass Trino and directly synchronize data from the underlying storage systems?

Hi @sunxiaojian Trino could join different source and make some views so it could be complicated to find exactly source attributes in one seatunnel source.

Additionally I make some tests with iceberg table and it is working solution with bypass strategy to sink data to Trino iceberg.

ok, I understand. Combining data from different sources is a great scenario.

@sunxiaojian
Copy link
Contributor

sunxiaojian commented Sep 25, 2024

Let me try to support arrow flight sql. arrow flight sql

@YuriyGavrilov
Copy link
Author

Let me try to support arrow flight sql. arrow flight sql

@sunxiaojian wow it is great news, I'm happy to know that there are people who care about arrow :) thx it will be super useful maybe not today but "tomorrow" for sure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants