Deploy python model without java router #314

epa095 · 2018-11-27T13:34:52Z

We want to deploy a large number of models (modelling different things, not different versions of the same thing) using the python wrapper, and in many cases we just want the model, not any transformer or fancy AB routing. In our case we see that the Python model takes roughly 200Mb of memory, while the Java Spring application which is also deployed takes roughly 700 Mb. Is it possible to write a seldondeployment which does not deploy the java application for the cases when we just want to deploy a "naked" model? The relevant part of our seldondeployment is:

predictors: 
  - 
    annotations: 
      predictor_version: "v1"
    componentSpecs: 
      - spec: 
          containers: 
            - image: "..some-docker-image"
              name: "modelserver-{{auto-generated-name}}"
    graph: 
      children: []
      endpoint: 
        type: "REST"
      name: "modelserver-{{auto-generated-name}}"
      type: "MODEL"
    name: "modelserver-{{auto-generated-name}}"
    replicas: 1

ukclivecox · 2018-11-27T13:43:24Z

I think there are a couple of things here:

Allowing direct access via the API for single model scenarios like yours where a service orchestrator is not required. This is in our roadmap. We need to conform the APIs internal and external to be the same so they can be used in this way and then allow an annotation to wire things up for the case where no service orchestrator is used.
For the size of the running images, yes we want to ensure smaller image footprints. You can set the JAVA_OPTS for the engine to decrease the default JVM usage.

ukclivecox · 2019-08-21T08:36:58Z

Close for now. Please reopen if still relevant.

* small update to docs * add x-request-id * lint * review fixes

ukclivecox added this to the 0.3.x milestone Jan 27, 2019

ukclivecox added Operator Service Orchestrator and removed Service Orchestrator labels Jan 27, 2019

ukclivecox mentioned this issue Jan 27, 2019

Allow single model graphs without Orchestrator #207

Closed

ukclivecox changed the title ~~Deploy python model withouth java router~~ Deploy python model without java router Jan 28, 2019

ukclivecox closed this as completed Aug 21, 2019

agrski pushed a commit that referenced this issue Dec 2, 2022

Ensure X-Request-ID is returned and allow CLI inspect to use (#314)

91f59f1

* small update to docs * add x-request-id * lint * review fixes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy python model without java router #314

Deploy python model without java router #314

epa095 commented Nov 27, 2018

ukclivecox commented Nov 27, 2018

ukclivecox commented Aug 21, 2019

Deploy python model without java router #314

Deploy python model without java router #314

Comments

epa095 commented Nov 27, 2018

ukclivecox commented Nov 27, 2018

ukclivecox commented Aug 21, 2019