Skip to content

Commit

Permalink
[ETL-409] Create python comparison script (#42)
Browse files Browse the repository at this point in the history
* update code to use datacompy as main comparison tool
* add comparison report for edge scenarios like no data types in common, dups
* add function for converting csv to text for s3.put_object, add tests for getting all row diff, add linting, adjust function to output datacompy.Compare object to allow for more logs
* use resolvedoptions from aws glue, adjust all args and args validation func, remove unused lib
* add logging for memory for datasets
---------

Co-authored-by: Rixing Xu <[email protected]>
  • Loading branch information
rxu17 and Rixing Xu authored Apr 27, 2023
1 parent 754fe3d commit 0ab9b99
Show file tree
Hide file tree
Showing 5 changed files with 1,897 additions and 234 deletions.
3 changes: 3 additions & 0 deletions Pipfile
Original file line number Diff line number Diff line change
Expand Up @@ -13,3 +13,6 @@ pre-commit = "*"
sceptre = ">=3.2.0"
sceptre-sam-handler = "*"
synapseclient = "~=2.7"
pandas = "<1.5"
moto = "~=4.1"
datacompy = "~=0.8"
Loading

0 comments on commit 0ab9b99

Please sign in to comment.