-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generate Avro schema from AnVIL schema #6109
Comments
Assignee to consider next steps. |
The current implementation generates the schema dynamically, based on the replica documents in the manifest currently being generated. This means that the schema could be different between manifests, depending on which files match the filter and which parts of the schema are expressed by the replicas connected to those files. This could lead to missing tables and missing columns in the workspace the manifest is exported to. While that may be considered by some a cosmetic issue or even a feature, entity relations can only be reliably expressed in the AvroPFB schema when deriving it from the foreign-key relationships in the expressed AnVIL schema. IOW, this is blocking #6066. |
For demo, perform two small handovers (the fewer rows, the better) for two different datasets, into two distinct workspaces. Show that the table schema is identical in both workspaces. |
@hannes-ucsc: "This was successfully demoed, albeit with different instructions. We had to look at the PFB schema and contents because Terra appears to drop empty tables and empty columns. An empty column is a scalar nullable column that is |
Screencaps of two Terra workspaces showing inconsistent tables and columns: Manifests reproducible in catalog Creating manifests with the same filters via the Swagger UI and inspecting the resulting PFB files on the command line confirmed that their schemas were identical. The tables missing from the first workspace (
And the missing columns from the biosample table are present in both the schema and the biosample entities:
|
I've alerted the BI about this on Slack. From the Azul perspective, this feature is complete and the issue can be closed, after reassigning the implementor and moving it to Stable. |
Description: #6109 (comment)
The text was updated successfully, but these errors were encountered: