In this guide, we illustrate the steps needed to wrap your own Java model in a docker image ready for deployment with Seldon Core using source-to-image app s2i.
If you are not familar with s2i you can read general instructions on using s2i and then follow the steps below.
- Prequisites for using s2i are:
- Docker
- Git (if building from a remote git repo)
To check everything is working you can run
s2i usage seldonio/seldon-core-s2i-java-build:0.1
To use our s2i builder image to package your Java model you will need:
- A Maven project that depends on
io.seldon.wrapper
library - A Spring Boot configuration class
- A class that implements
io.seldon.wrapper.SeldonPredictionService
for the type of component you are creating - .s2i/environment - model definitions used by the s2i builder to correctly wrap your model
We will go into detail for each of these steps:
Create a Spring Boot Maven project and include the dependency:
<dependency>
<groupId>io.seldon.wrapper</groupId>
<artifactId>seldon-core-wrapper</artifactId>
<version>0.1.1</version>
</dependency>
A full example can be found at wrappers/s2i/java/test/model-template-app/pom.xml
.
Create a main App class:
- Add @EnableAsync annotation (to allow the embedded gRPC server to start at Spring Boot startup)
- include the
io.seldon.wrapper
in the scan base packages list along with your App's package, in the example below the Apps's package isio.seldon.example
. - Import the config class at
io.seldon.wrapper.config.AppConfig.class
For example:
@EnableAsync
@SpringBootApplication(scanBasePackages = {"io.seldon.wrapper","io.seldon.example"})
@Import({ io.seldon.wrapper.config.AppConfig.class })
public class App {
public static void main(String[] args) throws Exception {
SpringApplication.run(App.class, args);
}
}
To handle requests to your model or other component you need to implement one or more of the methods in io.seldon.wrapper.SeldonPredictionService
, in particular:
default public SeldonMessage predict(SeldonMessage request);
default public SeldonMessage route(SeldonMessage request);
default public SeldonMessage sendFeedback(Feedback request);
default public SeldonMessage transformInput(SeldonMessage request);
default public SeldonMessage transformOutput(SeldonMessage request);
default public SeldonMessage aggregate(SeldonMessageList request);
Your implementing class should be created as a Spring Component so it will be managed by Spring. There is a full H2O example in examples/models/h2o_mojo/src/main/java/io/seldon/example/h2o/model
, whose implmentation is show below:
@Component
public class H2OModelHandler implements SeldonPredictionService {
private static Logger logger = LoggerFactory.getLogger(H2OModelHandler.class.getName());
EasyPredictModelWrapper model;
public H2OModelHandler() throws IOException {
MojoReaderBackend reader =
MojoReaderBackendFactory.createReaderBackend(
getClass().getClassLoader().getResourceAsStream(
"model.zip"),
MojoReaderBackendFactory.CachingStrategy.MEMORY);
MojoModel modelMojo = ModelMojoReader.readFrom(reader);
model = new EasyPredictModelWrapper(modelMojo);
logger.info("Loaded model");
}
@Override
public SeldonMessage predict(SeldonMessage payload) {
List<RowData> rows = H2OUtils.convertSeldonMessage(payload.getData());
List<AbstractPrediction> predictions = new ArrayList<>();
for(RowData row : rows)
{
try
{
BinomialModelPrediction p = model.predictBinomial(row);
predictions.add(p);
} catch (PredictException e) {
logger.info("Error in prediction ",e);
}
}
DefaultData res = H2OUtils.convertH2OPrediction(predictions, payload.getData());
return SeldonMessage.newBuilder().setData(res).build();
}
}
The above code:
- loads a model from the local resources folder on startup
- Converts the proto buffer message into H2O RowData using provided utility classes.
- Runs a BionomialModel prediction and converts the result back into a
SeldonMessage
for return
We provide H2O utility class io.seldon.wrapper.utils.H2OUtils
in seldon-core-wrapper to convert to and from the seldon-core proto buffer message types.
We provide a DL4J utility class io.seldon.wrapper.utils.DL4JUtils
in seldon-core-wrapper to convert to and from the seldon-core proto buffer message types.
Define the core parameters needed by our R builder image to wrap your model. An example is:
API_TYPE=REST
SERVICE_TYPE=MODEL
These values can also be provided or overriden on the command line when building the image.
Use s2i build
to create your Docker image from source code. You will need Docker installed on the machine and optionally git if your source code is in a public git repo.
Using s2i you can build directly from a git repo or from a local source folder. See the s2i docs for further details. The general format is:
s2i build <git-repo> seldonio/seldon-core-s2i-java-build:0.1 <my-image-name> --runtime-image seldonio/seldon-core-s2i-java-runtime:0.1
s2i build <src-folder> seldonio/seldon-core-s2i-java-build:0.1 <my-image-name> --runtime-image seldonio/seldon-core-s2i-java-runtime:0.1
An example invocation using the test template model inside seldon-core:
s2i build https://github.com/seldonio/seldon-core.git --context-dir=wrappers/s2i/python/test/model-template-app seldonio/seldon-core-s2i-java-build:0.1 h2o-test:0.1 --runtime-image seldonio/seldon-core-s2i-java-runtime:0.1
The above s2i build invocation:
- uses the GitHub repo: https://github.com/seldonio/seldon-core.git and the directory
wrappers/s2i/R/test/model-template-app
inside that repo. - uses the builder image
seldonio/seldon-core-s2i-java-build
- uses the runtime image
seldonio/seldon-core-s2i-java-runtime
- creates a docker image
seldon-core-template-model
For building from a local source folder, an example where we clone the seldon-core repo:
git clone https://github.com/seldonio/seldon-core.git
cd seldon-core
s2i build wrappers/s2i/R/test/model-template-app seldonio/seldon-core-s2i-java-build:0.1 h2o-test:0.1 --runtime-image seldonio/seldon-core-s2i-java-runtime:0.1
For more help see:
s2i usage seldonio/seldon-core-s2i-java-build:0.1
s2i build --help
The required environment variables understood by the builder image are explained below. You can provide them in the .s2i/enviroment
file or on the s2i build
command line.
API type to create. Can be REST or GRPC.
The service type being created. Available options are:
- MODEL
- ROUTER
- TRANSFORMER
- COMBINER