-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add deterministic hash methods to all types #573
Conversation
test: add tests to all new hash implementations
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #573 +/- ##
=========================================
Coverage 100.00% 100.00%
=========================================
Files 50 50
Lines 2915 2956 +41
=========================================
+ Hits 2915 2956 +41 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great.
## [0.20.0](v0.19.0...v0.20.0) (2024-04-03) ### Features * add deterministic hash methods to all types ([#573](#573)) ([f6a3ca7](f6a3ca7)) * add fnn functionality ([#529](#529)) ([ce53153](ce53153)), closes [#522](#522) * add suffixes to models to indicate their task ([#588](#588)) ([d490dee](d490dee)) * added lag_plot ([#548](#548)) ([0fb38d2](0fb38d2)), closes [#519](#519) * added normal plot for time series ([#550](#550)) ([dbdf11e](dbdf11e)), closes [#549](#549) * when using from table to time series feature must be given ([#572](#572)) ([ca23f0f](ca23f0f)), closes [#571](#571) ### Bug Fixes * incorrect type hint for `number_of_bins` parameter ([#567](#567)) ([b434e53](b434e53)) * mark various API elements as internal ([#587](#587)) ([ea176fc](ea176fc)), closes [#582](#582) [#585](#585)
🎉 This PR is included in version 0.20.0 🎉 The release is available on:
Your semantic-release bot 📦🚀 |
Summary of Changes
__hash__
methods to the all types__eq__
method toTaggedTable
andTimeSeries
The
__hash__
-Implementation is needed to more efficiently check whether data may be the same.A deterministic implementation is used (based on
xxhash
, a fast non-cryptographic hash algorithm) to allow these comparisons to be performed across different interpreters (processes).The implementation of the Schema was updated to be deterministic.