caikit.runtime.servicers.model_runtime_servicer =============================================== .. py:module:: caikit.runtime.servicers.model_runtime_servicer Attributes ---------- .. autoapisummary:: caikit.runtime.servicers.model_runtime_servicer.log Classes ------- .. autoapisummary:: caikit.runtime.servicers.model_runtime_servicer.ModelRuntimeServicerImpl Module Contents --------------- .. py:data:: log .. py:class:: ModelRuntimeServicerImpl(interrupter: caikit.runtime.work_management.abortable_context.ThreadInterrupter = None) Bases: :py:obj:`caikit.runtime.protobufs.model_runtime_pb2_grpc.ModelRuntimeServicer` This class contains the implementation of all of the RPCs that are required to run a service in Model Mesh as a Model-Runtime. .. py:attribute:: model_manager :value: None .. py:attribute:: interrupter :value: None .. py:method:: loadModel(request, context) Model loading . Args: request(model_runtime_pb2.LoadModelRequest): gRPC request, gen from model-runtime.proto context(grpc.ServicerContext): Context object (contains request metadata, etc) Returns: model_runtime_pb2.LoadModelResponse: Gen from model-runtime.proto .. py:method:: unloadModel(request, context) Model unloading. Args: request(model_runtime_pb2.UnloadModelRequest): gRPC request, gen from model-runtime.proto context(grpc.ServicerContext): Context object (contains request metadata, etc) Returns: model_runtime_pb2.UnloadModelResponse: Gen from model-runtime.proto .. py:method:: predictModelSize(request, context) Predict size of not-yet-loaded model Args: request(model_runtime_pb2.PredictModelSizeRequest): gRPC request, gen from model-runtime.proto context(grpc._server._Context): Context object (contains request metadata, etc) Returns: model_runtime_pb2.PredictModelSizeResponse: Gen from model-runtime.proto .. py:method:: modelSize(request, context) Compute size (memory consumption) of currently-loaded model Args: request(model_runtime_pb2.ModelSizeRequest): gRPC request, gen from model-runtime.proto context(grpc._server._Context): Context object (contains request metadata, etc) Returns: model_runtime_pb2.ModelSizeResponse: Gen from model-runtime.proto .. py:method:: runtimeStatus(request, context) Runtime status checking. Args: request(model_runtime_pb2.RuntimeStatusRequest): GRPC request, gen from model-runtime.proto context(grpc.ServicerContext): Context object (contains request metadata, etc) Returns: model_runtime_pb2.RuntimeStatusResponse: Gen from model-runtime.proto