-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
perf(matadata-io): neo4j generateLineageStatement use shortestPath #7219
Conversation
use neo4j shortestPath function to avoid nested/circle path
@aditya-radhakrishnan hi, Could you help me review this patch? |
Hey @shidianshifen sorry for the delay here! Have you been able to test these changes? Looks good to me though :) |
yes, i tested with no problem. and i have are around 12000+ entities and 12 hops query results from neo4j decrease from 7 million to actual 1500+ using shortestPath |
add missing '' for urn in neo4j cypher template
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thank you for making this change :)
Will merge once CI is passing. |
@aditya-radhakrishnan Thanks very much for your review. |
@aditya-radhakrishnan this patch not merge in to master, should i do some else to make it happen? and neo4j graph backend not support for time filter in new release, i am prepare to commit a patch to make it work. what could be done next? |
Hey @shidianshifen this is true, Neo4j time filtering is not supported with the new release. We can merge this one separately and then collaborate on updating for time filtering! I will message you on Slack! |
CI is running now (not sure why it didn't before). Will merge once green, apologies it didn't go through before! |
…atahub-project#7219) Co-authored-by: Aseem Bansal <[email protected]>
…atahub-project#7219) Co-authored-by: Aseem Bansal <[email protected]>
…7219) Co-authored-by: Aseem Bansal <[email protected]>
nested/circle relation or lineage between tables as show below would lead to expensive match cypher results get from neo4j database
find all table_A downstream tables using current cypher:
results:
using neo4j shortestPath cypher become:
results:
The results pass to gms decreased significantly from 14 to 5. this could help to avoid gms oom and gc problem
Checklist