Removing proto functionality from REST /predict endpoint [#803] #806

axsaucedo · 2019-08-15T15:40:34Z

This is the implementation to fix #803.

Once this is landed there will still be work to replicate across remaining endpoints (as per #607)

lennon310 · 2019-08-16T00:57:02Z

The profiler result shows parsing time reduces for REST but not GRPC, is it expected?

axsaucedo · 2019-08-16T07:37:49Z

@lennon310 that is correct, I've replied in #803, we should be able to merge this soon

lennon310 · 2019-08-19T14:50:12Z

Also I was wondering whether this change applies to the prepackaged server as well?

axsaucedo · 2019-08-19T16:15:05Z

@lennon310 yes, although the model servers will also be bumped a version when they are recompiled.

axsaucedo · 2019-08-22T09:54:31Z

Added a couple more tests. PR should be ready for review. @lennon310 could you provide a review as well given that you have been currently going through these pieces?

lennon310 · 2019-08-22T13:23:16Z

python/seldon_core/utils.py

@@ -397,8 +463,12 @@ def construct_response(user_model: SeldonComponent, is_request: bool, client_req
        raise SeldonMicroserviceException("Unknown data type returned as payload:" + client_raw_response)


-def extract_request_parts_json(request_raw: Union[Dict, List]) -> Tuple[
-    Union[np.ndarray, str, bytes, dict], Dict, prediction_pb2.DefaultData, str]:
+def extract_request_parts_json(request: Union[Dict, List]


This PR looks good to me.
Probably not related to this PR, but I was wondering why gRPC path should not go this way.
From my profiler commented in issue it shows in gRPC the input was first converted to datadef in SeldonMessage:

100 0.240 0.002 1.281 0.013 utils.py:234(array_to_grpc_datadef) 20100/100 0.038 0.000 1.041 0.010 utils.py:280(array_to_list_value)

then in predict function it is converted back to Numpy array:

100 0.000 0.000 2.781 0.028 utils.py:505(extract_request_parts) 100 0.001 0.000 2.778 0.028 utils.py:120(get_data_from_proto) 100 0.001 0.000 2.778 0.028 utils.py:147(grpc_datadef_to_array)

The round-trip conversion seems double the latency I guess?

axsaucedo · 2019-08-22T16:18:17Z

@lennon310 the conversion is necessary as in GRPC the request is received as Proto, and needs to be converted into a numpy array before it's passed to the python wrapper. You can access the raw proto through the predict_raw function, but if you want to access the data in a readable format you will eventually have to convert it. Similarly when sending the response, it's necessary to convert the output (which is often a numpy array) into a Proto - which in turn requires that conversion.

We're happy to take suggestions but if you have a look at the functions, it's basically converting the Proto directly to numpy array. If you run some alternative implementations of methods to convert numpy arrays to/from proto objects that are more efficient, we would certainly be keen to introduce them.

Added initial implementation with tests

8c352a9

axsaucedo requested a review from ukclivecox August 15, 2019 15:40

seldondev added the size/L label Aug 15, 2019

axsaucedo added 2 commits August 15, 2019 16:43

removed print statements

47f2d8a

Added seldon core utils

d225ea8

Added tests

4918ff9

lennon310 reviewed Aug 22, 2019

View reviewed changes

Merged with master

1767430

axsaucedo merged commit 1baab0f into SeldonIO:master Aug 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removing proto functionality from REST /predict endpoint [#803] #806

Removing proto functionality from REST /predict endpoint [#803] #806

axsaucedo commented Aug 15, 2019

lennon310 commented Aug 16, 2019

axsaucedo commented Aug 16, 2019

lennon310 commented Aug 19, 2019

axsaucedo commented Aug 19, 2019

axsaucedo commented Aug 22, 2019

lennon310 Aug 22, 2019

axsaucedo commented Aug 22, 2019

Removing proto functionality from REST /predict endpoint [#803] #806

Removing proto functionality from REST /predict endpoint [#803] #806

Conversation

axsaucedo commented Aug 15, 2019

lennon310 commented Aug 16, 2019

axsaucedo commented Aug 16, 2019

lennon310 commented Aug 19, 2019

axsaucedo commented Aug 19, 2019

axsaucedo commented Aug 22, 2019

lennon310 Aug 22, 2019

Choose a reason for hiding this comment

axsaucedo commented Aug 22, 2019