Skip to content

Latest commit

 

History

History
80 lines (51 loc) · 4.57 KB

README.md

File metadata and controls

80 lines (51 loc) · 4.57 KB

Contents of this Document

TorchServe Examples

The following are examples on how to create and serve model archives with TorchServe.

Creating mar file for eager mode model

Following are the steps to create a torch-model-archive (.mar) to execute an eager mode torch model in TorchServe :

  • Pre-requisites to create a torch model archive (.mar) :

    • serialized-file (.pt) : This file represents the state_dict in case of eager mode model.
    • model-file (.py) : This file contains model class extended from torch nn.modules representing the model architecture. This parameter is mandatory for eager mode models. This file must contain only one class definition extended from torch.nn.modules
    • index_to_name.json : This file contains the mapping of predicted index to class. The default TorchServe handles returns the predicted index and probability. This file can be passed to model archiver using --extra-files parameter.
    • version : Model's version.
    • handler : TorchServe default handler's name or path to custom inference handler(.py)
  • Syntax

    torch-model-archiver --model-name <model_name> --version <model_version_number> --model-file <path_to_model_architecture_file> --serialized-file <path_to_state_dict_file> --handler <path_to_custom_handler_or_default_handler_name> --extra-files <path_to_index_to_name_json_file>

Creating mar file for torchscript mode model

Following are the steps to create a torch-model-archive (.mar) to execute an eager mode torch model in TorchServe :

  • Pre-requisites to create a torch model archive (.mar) :

    • serialized-file (.pt) : This file represents the state_dict in case of eager mode model or an executable ScriptModule in case of TorchScript.
    • index_to_name.json : This file contains the mapping of predicted index to class. The default TorchServe handles returns the predicted index and probability. This file can be passed to model archiver using --extra-files parameter.
    • version : Model's version.
    • handler : TorchServe default handler's name or path to custom inference handler(.py)
  • Syntax

    torch-model-archiver --model-name <model_name> --version <model_version_number> --serialized-file <path_to_executable_script_module> --extra-files <path_to_index_to_name_json_file> --handler <path_to_custom_handler_or_default_handler_name>

Example torchvision image classification models in TorchServe (Eager mode)

The following example demonstrates how to create image classifier model archive, serve it on TorchServe and run image prediction using TorchServe's default image_classifier handler :

Example to serve a Custom Model with Custom Service Handler (Eager mode)

The following example demonstrates how to create and serve a custom NN model with custom handler archives in TorchServe :

Example to serve Text classification model (Eager mode)

The following example demonstrates how to create and serve a custom text_classification NN model with default text_classifer handler provided by TorchServe :

Example to serve Object Detection model (Eager mode)

The following example demonstrates how to create and serve a pretrained fast-rcnn NN model with default object_detector handler provided by TorchServe :

Example to serve Image Segmentation model (Eager mode)

The following example demonstrates how to create and serve a pretrained fcn NN model with default image_segmenter handler provided by TorchServe :

Example to serve Huggingface Transformers (Torchscripted)

The following example demonstrates how to create and serve a pretrained transformer models from Huggingface such as BERT, RoBERTA, XLM :