Skip to main content
Version: master πŸƒ

Submarine Server Implementation

Architecture Overview​

    +---------------Submarine Server ---+
| |
| +------------+ +------------+ |
| |Web Svc/Prxy| |Backend Svc | | +--Submarine Asset +
| +------------+ +------------+ | |Project/Notebook |
| ^ ^ | |Model/Metrics |
+---|---------|---------------------+ |Libraries/Dataset |
| | +------------------+
| |
| +--|-Compute Cluster 1---+ +--Image Registry--+
+ | | | | User's Images |
User / | + | | |
Admin | User Notebook Instance | +------------------+
| Experiment Runs |
+------------------------+ +-Data Storage-----+
| S3/HDFS, etc. |
+----Compute Cluster 2---+ | |
+------------------+
...

Here's a diagram to illustrate the Submarine's deployment.

  • Submarine Server consists of web service/proxy, and backend services. They're like "control planes" of Submarine, and users will interact with these services.
  • Submarine server could be a microservice architecture and can be deployed to one of the compute clusters. (see below, this will be useful when we only have one cluster).
  • There're multiple compute clusters that could be used by Submarine service. For user's running notebook instance, jobs, etc. they will be placed to one of the compute clusters by user's preference or defined policies.
  • Submarine's asset includes project/notebook(content)/models/metrics/dataset-meta, etc. can be stored inside Submarine's own database.
  • Datasets can be stored in various locations such as S3/HDFS.
  • Users can push container (such as Docker) images to a preconfigured registry in Submarine, so Submarine service can know how to pull required container images.
  • Image Registry/Data-Storage, etc. are outside of Submarine server's scope and should be managed by 3rd party applications.

Submarine Server and its APIs​

Submarine server is designed to allow data scientists to access notebooks, submit/manage jobs, manage models, create model training workflows, access datasets, etc.

Submarine Server exposed UI and REST API. Users can also use CLI / SDK to manage assets inside Submarine Server.

           +----------+
| CLI |+---+
+----------+ v +----------------+
+--------------+ | Submarine |
+----------+ | REST API | | |
| SDK |+>| |+> Server |
+----------+ +--------------+ | |
^ +----------------+
+----------+ |
| UI |+---+
+----------+

REST API will be used by the other 3 approaches. (CLI/SDK/UI)

The REST API Service handles HTTP requests and is responsible for authentication. It acts as the caller for the JobManager component.

The REST component defines the generic job spec which describes the detailed info about job. For more details, refer to here. (Please note that we're converting REST endpoint description from Java-based REST API to swagger definition, once that is done, we should replace the link with swagger definition spec).

Proposal​


+-----------+
| |
| workbench +---+ +----------------------------------+
| | | | +------+ +---------------------+ |
+-----------+ | | | | | +-------+ | | +---------------------+
| | | | | | K8s | | | | +--------+ +----+ |
+-----------+ | | | | | +-------+ | | | | +-->+job1| |
| | | | | | | submitter | | | | | +----+ |
| CLI +------>+ | REST | +---------------------+ +---->+ |operator| +----+ |
| | | | | | +---------------------+ | | | +-->+job2| |
+-----------+ | | | | | +-------+ +-------+ | | | +--------+ +----+ |
| | | | | |PlugMgr| |monitor| | | | K8s Cluster |
+-----------+ | | | | | +-------+ +-------+ | | +---------------------+
| | | | | | | JobManager | |
| SDK +---+ | +------+ +---------------------+ |
| | +----------------------------------+
+-----------+
client server

We propose to split the original core module in the old layout into two modules, CLI and server as shown in FIG. The submarine-client calls the REST APIs to submit and retrieve the job info. The submarine-server provides the REST service, job management, submitting the job to cluster, and running job in different clusters through the corresponding runtime.

Submarine Server Components​


+----------------------Submarine Server--------------------------------+
| +-----------------+ +------------------+ +--------------------+ |
| | Experiment | |Notebook Session | |Environment Mgr | |
| | Mgr | |Mgr | | | |
| +-----------------+ +------------------+ +--------------------+ |
| |
| +-----------------+ +------------------+ +--------------------+ |
| | Model Registry | |Model Serving Mgr | |Compute Cluster Mgr | |
| | | | | | | |
| +-----------------+ +------------------+ +--------------------+ |
| |
| +-----------------+ +------------------+ +--------------------+ |
| | DataSet Mgr | |User/Team | |Metadata Mgr | |
| | | |Permission Mgr | | | |
| +-----------------+ +------------------+ +--------------------+ |
+----------------------------------------------------------------------+

Experiment Manager​

TODO

Notebook Sessions Manager​

TODO

Environment Manager​

TODO

Model Registry​

TODO

Model Serving Manager​

TODO

Compute Cluster Manager​

TODO

Dataset Manager​

TODO

User/team permissions manager​

TODO

Metadata Manager​

TODO

Components/services outside of Submarine Server's scope​

TODO: Describe what are the out-of-scope components, which should be handled and managed outside of Submarine server. Candidates are: Identity management, data storage, metastore storage, etc.