fix #690 -- blob packing/unpacking of native python bool, int, float, and complex. #709

dimitri-yatsenko · 2019-11-22T23:16:08Z

… and complex

…t, float, and complex

guzman-raphael · 2019-12-26T16:24:42Z

datajoint/blob.py

+
+    @staticmethod
+    def pack_int(v):
+        return b"\x0a" + np.array(v, dtype='int64').tobytes()


Is there a reason why we did not utilize decimal packing here? Python int are essentially boundless (memory-dependent). I believe decimal packing would be a closer representation as the length would be encoded.

modified to support unbounded int

guzman-raphael · 2019-12-26T16:28:39Z

datajoint/blob.py

+
+    @staticmethod
+    def pack_float(v):
+        return b"\x0d" + np.array(v, dtype='float64').tobytes()


Is there a reason why we did not utilize decimal packing here? Python float have a precision of 53 bits which means we would be storing unnecessary additional data.

guzman-raphael

Would like for us to consider utilizing decimal packing so that we may store all int bits and only the necessary bits to properly represent other new types. Also, we should be careful to add documentation that this upgrade might require to be conducted as system-wide/user-wide. Consider the following scenario:

If users are relying on DJ to infer the data types, then if a current query is inserting a list such as [1,2,3] then previously this would be inserted as list(np.int64(1),np.int64(2),np.int64(3)). Now with this update it would inserted as list(int(1),int(2),int(3)). Since the update is backward compatible, all new users would be good with fetching data, however, users utilizing the previous DJ version would receive errors on a fetch using their same query as blob data now contains mixed packing. Since the error is on a previous version of DJ, the error message is somewhat vague e.g.

Unknown data structure code "
"

guzman-raphael · 2019-12-26T16:30:47Z

datajoint/blob.py

+
+    @staticmethod
+    def pack_complex(v):
+        return b"\x0c" + np.array(v, dtype='complex128').tobytes()


We could utilize decimal packing here for the same reasons as float below. Python seems to capture the first 53 bits for each the real part and the complex part.

here Python is not doing anything special and just uses the standard IEEE 754 encoding.

guzman-raphael · 2020-01-07T19:13:33Z

@dimitri-yatsenko Can you update datajoint-python/docs-parts/intro/Releases_lang1.rst?

Update release details

fix datajoint#690 -- blob packing/unpacking of native python bool, in…

681fb97

…t, float, and complex

dimitri-yatsenko changed the title ~~fix #690 -- blob packing/unpacking of native python bool, int, float,…~~ fix #690 -- blob packing/unpacking of native python bool, int, float, and complex. Nov 22, 2019

dimitri-yatsenko added 4 commits November 22, 2019 17:17

minor

a4e5382

reduce encoding length for native python types in blobs

e348426

Merge branch 'master' of https://github.com/datajoint/datajoint-python

9c2e419

ensure that np.number is encoded as a numpy scalar

86a2c2c

guzman-raphael self-requested a review December 23, 2019 19:39

guzman-raphael reviewed Dec 26, 2019

View reviewed changes

guzman-raphael requested changes Dec 26, 2019

View reviewed changes

dimitri-yatsenko added 9 commits December 30, 2019 09:10

Merge branch 'master' of https://github.com/datajoint/datajoint-python

231efe2

add support for unbounded integers in blob serialization

106239c

add test for unbounded integer

eadde37

update CHANGELOG and version for release 0.12.4

f1e6da6

correct computation of number of bits for unbounded integers in blobs

392d56a

fix unbounded integer encoding in blobs

61362e7

fix bug in LNX-docker-compose.yml

4a56d42

improve tests for adapted attributes

876d62a

update comment to use general data types rather than python-focused

8a3c9a1

guzman-raphael and others added 2 commits January 14, 2020 12:42

Update release details.

92f56ab

Merge pull request #7 from guzman-raphael/pr709

9be1115

Update release details

guzman-raphael approved these changes Jan 14, 2020

View reviewed changes

guzman-raphael merged commit a9aad89 into datajoint:master Jan 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #690 -- blob packing/unpacking of native python bool, int, float, and complex. #709

fix #690 -- blob packing/unpacking of native python bool, int, float, and complex. #709

dimitri-yatsenko commented Nov 22, 2019

guzman-raphael Dec 26, 2019

dimitri-yatsenko Dec 30, 2019

guzman-raphael Dec 26, 2019

guzman-raphael left a comment

guzman-raphael Dec 26, 2019

dimitri-yatsenko Dec 30, 2019

guzman-raphael commented Jan 7, 2020

fix #690 -- blob packing/unpacking of native python bool, int, float, and complex. #709

fix #690 -- blob packing/unpacking of native python bool, int, float, and complex. #709

Conversation

dimitri-yatsenko commented Nov 22, 2019

guzman-raphael Dec 26, 2019

Choose a reason for hiding this comment

dimitri-yatsenko Dec 30, 2019

Choose a reason for hiding this comment

guzman-raphael Dec 26, 2019

Choose a reason for hiding this comment

guzman-raphael left a comment

Choose a reason for hiding this comment

guzman-raphael Dec 26, 2019

Choose a reason for hiding this comment

dimitri-yatsenko Dec 30, 2019

Choose a reason for hiding this comment

guzman-raphael commented Jan 7, 2020