Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deploy python model without java router #314

Closed
epa095 opened this issue Nov 27, 2018 · 2 comments
Closed

Deploy python model without java router #314

epa095 opened this issue Nov 27, 2018 · 2 comments

Comments

@epa095
Copy link

epa095 commented Nov 27, 2018

We want to deploy a large number of models (modelling different things, not different versions of the same thing) using the python wrapper, and in many cases we just want the model, not any transformer or fancy AB routing. In our case we see that the Python model takes roughly 200Mb of memory, while the Java Spring application which is also deployed takes roughly 700 Mb. Is it possible to write a seldondeployment which does not deploy the java application for the cases when we just want to deploy a "naked" model? The relevant part of our seldondeployment is:

predictors: 
  - 
    annotations: 
      predictor_version: "v1"
    componentSpecs: 
      - spec: 
          containers: 
            - image: "..some-docker-image"
              name: "modelserver-{{auto-generated-name}}"
    graph: 
      children: []
      endpoint: 
        type: "REST"
      name: "modelserver-{{auto-generated-name}}"
      type: "MODEL"
    name: "modelserver-{{auto-generated-name}}"
    replicas: 1
@ukclivecox
Copy link
Contributor

I think there are a couple of things here:

  • Allowing direct access via the API for single model scenarios like yours where a service orchestrator is not required. This is in our roadmap. We need to conform the APIs internal and external to be the same so they can be used in this way and then allow an annotation to wire things up for the case where no service orchestrator is used.
  • For the size of the running images, yes we want to ensure smaller image footprints. You can set the JAVA_OPTS for the engine to decrease the default JVM usage.

@ukclivecox ukclivecox added this to the 0.3.x milestone Jan 27, 2019
@ukclivecox ukclivecox changed the title Deploy python model withouth java router Deploy python model without java router Jan 28, 2019
@ukclivecox
Copy link
Contributor

Close for now. Please reopen if still relevant.

agrski pushed a commit that referenced this issue Dec 2, 2022
* small update to docs

* add x-request-id

* lint

* review fixes
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants