-
Notifications
You must be signed in to change notification settings - Fork 303
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Airflow] Fixed run format returned by the lineage_parent_id
macro
#2489
[Airflow] Fixed run format returned by the lineage_parent_id
macro
#2489
Conversation
Thanks for opening your first OpenLineage pull request! We appreciate your contribution. If you haven't already, please make sure you've reviewed our guide for new contributors (https://github.com/OpenLineage/OpenLineage/blob/main/CONTRIBUTING.md). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, if the CI goes through i think we can merge that @blacklight
@blacklight still want to contribute that? I think this
|
@mobuchowski yes I'm still planning to merge it - I just didn't have much bandwidth in these days to look into the failing tests. I'll try and get the tests to pass today. |
@blacklight yes. |
The relevant test logs are in the artifacts tab on circleCI: https://app.circleci.com/pipelines/github/OpenLineage/OpenLineage/9985/workflows/e3653e9e-2ffd-4f11-bf6d-673d80738427/jobs/199074/artifacts |
…ents. Some pieces of documentation and tests still referenced the previous arguments for the `lineage_run_id` and `lineage_parent_id` macros. Signed-off-by: Fabio Manganiello <[email protected]>
…ents. Some pieces of documentation and tests still referenced the previous arguments for the `lineage_run_id` and `lineage_parent_id` macros. Signed-off-by: Fabio Manganiello <[email protected]>
85f0017
to
46f0855
Compare
…arent_id`. - Returned format: `<namespace>/<name>/<run_id>`. - `name` should be `<dag_id>.<task_id>`, not a UUID. - `run_id` should be a UUID, not `<run_timestamp>.<try_number>`. Signed-off-by: Fabio Manganiello <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: David Goss <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
- Both `lineage_run_id` and `lineage_parent_id` should expose the same interface - only a `TaskInstance` object is now required as argument. - Import `_DAG_NAMESPACE` instead of inferring it again. Signed-off-by: Fabio Manganiello <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…in listener.on_task_instance_running (OpenLineage#2492) Signed-off-by: Kacper Muda <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Kacper Muda <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Julien Le Dem <[email protected]> Signed-off-by: Julien Le Dem <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Zhenqiu Huang <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…efactored the existing loadOpenLineageYaml(InputStream) method (OpenLineage#2490) * Added a loadOpenLineageJson(InputStream) method, and refactored the loadOpenLineageYaml(InputStream) method * Improved the error handling for loadOpenLineageYaml(ConfigPathProvider) * Explicitly state the exceptions, despite being unchecked exceptions Signed-off-by: Damien Hawes <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…penLineage#2478) * Add the MSK IAM transport to support AWS MSK cluster instances without additional custom ones Signed-off-by: Mattia Bertorello <[email protected]> * Remove support to get the default from instance metadata Signed-off-by: Mattia Bertorello <[email protected]> * Remove test for instance metadata Signed-off-by: Mattia Bertorello <[email protected]> * Use only debug level logs for the transport Signed-off-by: Mattia Bertorello <[email protected]> * Remove redundant checks Signed-off-by: Mattia Bertorello <[email protected]> --------- Signed-off-by: Mattia Bertorello <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Zhenqiu Huang <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…ge#2502) Bumps [com.amazonaws:amazon-kinesis-producer](https://github.com/awslabs/amazon-kinesis-producer) from 0.15.9 to 0.15.10. - [Release notes](https://github.com/awslabs/amazon-kinesis-producer/releases) - [Changelog](https://github.com/awslabs/amazon-kinesis-producer/blob/master/CHANGELOG.md) - [Commits](awslabs/amazon-kinesis-producer@v0.15.9...v0.15.10) --- updated-dependencies: - dependency-name: com.amazonaws:amazon-kinesis-producer dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Signed-off-by: Fabio Manganiello <[email protected]>
…OpenLineage#2501) Bumps plugin.serialization from 1.9.22 to 1.9.23. --- updated-dependencies: - dependency-name: plugin.serialization dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Signed-off-by: Fabio Manganiello <[email protected]>
…OpenLineage#2500) Bumps plugin.serialization from 1.9.22 to 1.9.23. --- updated-dependencies: - dependency-name: plugin.serialization dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Zhenqiu Huang <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Pawel Leszczynski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Pawel Leszczynski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Jakub Dardzinski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
OpenLineage#2528) Signed-off-by: Maciej Obuchowski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Martynov Maxim <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…ents. Some pieces of documentation and tests still referenced the previous arguments for the `lineage_run_id` and `lineage_parent_id` macros. Signed-off-by: Fabio Manganiello <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Pawel Leszczynski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: Kacper Muda <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
* Add missing changes to changelog. --------- Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…in it's release job (OpenLineage#2575) Signed-off-by: Maciej Obuchowski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…t itself (OpenLineage#2576) Signed-off-by: Maciej Obuchowski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
…terfaces (OpenLineage#2577) Signed-off-by: Maciej Obuchowski <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
Signed-off-by: merobi-hub <[email protected]> Signed-off-by: Fabio Manganiello <[email protected]>
b342029
to
4267b5f
Compare
Signed-off-by: Fabio Manganiello <[email protected]>
@mobuchowski I have opened a new PR here. I had conflicting commits pushed both with my work and personal account, then upon I'll close this one and wait for the tests to complete on the other side. |
Problem
The macro currently returns a string in the format
<namespace>/<name>/<run_id>
.However, in this string:
name
should be<dag_id>.<task_id>
, not a UUID.run_id
should be a UUID, not<run_timestamp>.<try_number>
.This is to comply with the OpenLineage conventions used everywhere else.
Closes: #2488
Solution
As stated above,
name
andrun_id
should be populated in the expected format.The change may break back-compatibility in cases where
lineage_parent_id
is used without keyword-arguments, since the signature has changed from:To:
That makes the signature of
lineage_parent_id
consistent with that oflineage_run_id
.run_id
is now an optional parameter - it defaults to the one calculated vialineage_run_id
for the given instance, and it can optionally be overridden.One-line summary:
Fixed run format returned by the
lineage_parent_id
Airflow macro.Checklist
SPDX-License-Identifier: Apache-2.0
Copyright 2018-2023 contributors to the OpenLineage project