Support and Certification Matrix
The platform comes pre-deployed with proprietary and third-party open-source tools and libraries that are exposed as application services that are managed using Kubernetes. Relevant services can be viewed and managed by users from the platform dashboard using a self-service model. (Note that some services that don't require user intervention aren't visible in the dashboard.) Users can also enhance their development experience by independently installing additional software and run it on top of the platform services. For more information, see The Platform's Application Services.
The platform has two types of managed application services:
- Default services
- There are several service instances — such as Presto and the web APIs — that are spawn automatically when the platform starts and have a tenant-wide scope (i.e., they're accessible to all tenant users with service permissions). The default services can't be deleted by users, but service administrators can disable or restart these services and modify some service configurations.
- User-defined services
- Service administrators can create a wide variety of new service instances for certified services — such as Spark and Jupyter Notebook. Except where otherwise specified, user assigned services should be assigned to a specific running user but can optionally be shared also with all other tenant users with service permissions.
Pre-deployed Application Services and Tools
The following software packages, services, and tools are pre-deployed as part of the default version 3.0.1 platform installation:
Conda | Dashboard | Docker Registry | Elasticsearch | Frames | Grafana | Hadoop | Hive | Horovod | Jupyter | Kubernetes | Log Forwarder | MLRun | Monitoring | MPI Operator | Nuclio Serverless Framework | Operating System | Pipelines | Presto | OAuth2 (OIDC) Authenticator | Prometheus | Spark | Spark Operator | TSDB CLI (V3IO) | TSDB Nuclio Functions | V3IO Daemon | Web APIs | Web Shell | Zeppelin
- Open-source tools and related services are subject to open-source restrictions.
- See Application Library Versions for programming application library versions that are used or certified for usage by relevant services, such as Nuclio and Jupyter Notebook.
|Dashboard||Default||The platform's graphical user interface.||3.0.1|
|Kubernetes||Default||The Kubernetes (k8s) container orchestration system for automating deployment, scaling, and management of containerized applications. Application services in the platform run on top of Kubernetes.||k8s 1.17.14
|Nuclio Serverless Framework
|Default||Iguazio's Nuclio Enterprise Edition serverless framework for development, deployment, and execution of serverless functions for real-time data processing and analysis.
The Nuclio dashboard is available as part of the
|Default||Iguazio's https://github.com/mlrun/mlrun/tree/release/v0.6.x-latest/ open-source machine-learning operations (MLOps) orchestration framework for automating and tracking data science tasks and full workflows, including integration with Kubeflow Pipelines and the Nuclio serverless framework. For more information, see Data Science Automation (MLOps) Services.||0.6|
|Default||The Google Kubeflow Pipelines open-source framework for building and deploying portable, scalable machine learning (ML) workflows based on Docker containers. For more information, see Data Science Automation (MLOps) Services.||1.0.1|
|Default||The platform's web-APIs (web-gateway) service, which provides access to its web APIs.||3.0.1|
|Jupyter||User-defined||The JupyterLab UI, including the Jupyter Notebook web application and shell terminals and the Conda binary package and environment manager. For more information, see The Jupyter Notebook Service. See also the Jupyter application-libraries compatibility matrix.||JupyterLab 2.2.0|
|V3IO TSDB CLI||Default||The Iguazio V3IO Time-Series Database (TSDB) command-line interface (CLI) tool (
|V3IO TSDB Nuclio Functions||Default||The Iguazio V3IO TSDB Nuclio functions — Nuclio serverless functions that simplify working with the V3IO TSDB library.||0.6
V3IO TSDB 0.11
|V3IO Prometheus||User-defined||The Iguazio V3IO Prometheus distribution, which provides a version of the Prometheus systems monitoring and alerting toolkit that is packaged with the V3IO TSDB library and can be used to query time-series databases in the platform.||
|User-defined||The platform's V3IO Frames service, which provides access to the Frames API — an open-source unified high-performance Python DataFrame API for accessing NoSQL, stream, and time-series data in the platform's data store.||0.8 (server)
0.8 (supported client)
The Apache Spark data-processing engine, including the following libraries:
|Default||The spark-on-k8s-operator Kubernetes Operator for Spark ("Spark Operator"), which enables simplifying submission and scheduling of Spark jobs. This service is designed to be used via the MLRun Spark Operator API.||2.4.5|
|Default||The Presto distributed SQL query engine for big data.||332|
|Hive Metastore||Internal||An internal Apache Hive Metastore service that can be enabled for the Presto service to allow saving views and using the Presto Hive connector.||2.3.2|
|Horovod / MPI Operator
|Default||Distributed training using Kubeflow MPI Operator and Uber's Horovod distributed deep-learning framework for creating machine-learning models that are trained simultaneously over multiple GPUs or CPUs. For more information, see The MPI-Operator Horovod Service and Running Applications over GPUs.||0.2.3|
|Default||A platform service that uses Filebeat to forward application-service logs to be stored and indexed in an instance of the Elasticsearch search and analytics engine.
Note that this default service is disabled by default because you need to configure the URL of an Elasticsearch service for storing and indexing the logs. For more information, see Logging, Monitoring, and Debugging.
Supports Elasticsearch 7.10
|Default||A platform service for monitoring application services and gathering performance statistics and additional data. The gathered data is visualized on Grafana dashboards using the platform's Grafana services. For more information, see Monitoring Platform Services.||3.5|
|Grafana||User-defined||The Grafana analytics and monitoring platform.
In cloud platform environments, Grafana is currently available as a shared single-instance tenant-wide service.
The platform also has a shared single-instance tenant-wide application-cluster Grafana service with monitoring dashboards for the entire Kubernetes application cluster, which isn't visible on the
|Docker Registry||Default and user-defined||A platform service for working with a Docker Registry, which is used by the Nuclio service to store the function images.
You can create a Docker Registry service and configure it to work with a remote off-cluster Docker Registry.
On the default tenant, the Nuclio service is configured by default to work with a pre-deployed default tenant-wide
|OAuth2 (OIDC) Authenticator
|Default||A federated OpenID Connect (OIDC) provider over OAuth2, using OpenID Connect (OIDC). This service is used for OAuth2 authentication of user access to Nuclio API gateways and shared Grafana services, including access by external (non-platform) users.||2.23.0|
|Web Shell||User-defined||A platform service that provides a web-based command-line shell ("web shell") for running application services — such as Spark jobs and Presto queries — and performing basic file-system operations.
Note that this isn't a fully functional Linux shell. For more information, see The Web-Shell Service.
|Zeppelin||User-defined||The Apache Zeppelin web-based notebook.||0.8.2|
|Hadoop||Default||The Apache Hadoop distributed data-processing library. For more information, see The Hadoop Service.||2.9.0|
|Default (internal)||An internal service for integrating the platform with external applications by using the platform's V3IO library.||3.0.1|
|Operating System||Internal||The CentOS Linux operating-system.||7.6|
Application Library Versions
The following table provides information about the versions of application libraries (packages) that are used or certified for usage with different pre-deployed platform tools and services:
|Service||Pre-deployed and Certified Application Library Versions|
|Platform API Libraries||Java 8
|Nuclio||Python 3.6, 3.7, and 2.7|
Iguazio platform tutorials (v3io/tutorials) 3.0
Iguazio V3IO Python SDK (v3io/v3io-py) 0.5 (Python 3.5–3.8)
Nuclio Jupyter package (nuclio/nuclio-jupyter) 0.8
NVIDIA CUDA 11.0
NVIDIA RAPIDS 0.17
Frames client 0.8
|Web Shell||Python 3.7
Integration with Additional Tools
You can independently install additional software tools — such as TensorFlow, PyTorch, or scikit-learn — and use them on top of the platform services. You can also configure remotely installed tools — such as Tableau or Looker — to analyze and visualize data in the platform. In addition, you can use Conda (which is available as part of the platform's Jupyter Notebook service) and pip (which is available as part of the Jupyter Notebook, Zeppelin, and web-shell services) to install Python packages. For more information, see The Platform's Application Services.