Support and Certification Matrix
The platform comes pre-deployed with proprietary and third-party open-source tools and libraries that are exposed as application services that are managed using Kubernetes. Relevant services can be viewed and managed by users from the platform dashboard using a self-service model. (Note that some services that don’t require user intervention aren’t visible in the dashboard.) Users can also enhance their development experience by independently installing additional software and run it on top of the platform services. For more information, see Application Services and Tools.
The platform has two types of managed application services:
- Default services
- There are several service instances — such as Presto and the web APIs — that are spawn automatically when the platform starts and have a tenant-wide scope (i.e., they’re accessible to all tenant users with service permissions). The default services can’t be deleted by users, but service administrators can disable or restart these services and modify some service configurations.
- User-defined services
- Service administrators can create a wide variety of new service instances for certified services — such as Spark and Jupyter Notebook. Except where otherwise specified, user assigned services should be assigned to a specific running user but can optionally be shared also with all other tenant users with service permissions.
Pre-deployed Application Services and Tools
The following software packages, services, and tools are pre-deployed as part of the default version 2.10.0 platform installation:
Conda | Dashboard | Docker Registry | Frames | Grafana | Hadoop | Hive | Horovod | Jupyter | Kubernetes | Log Forwarder | MLRun | Monitoring | MPI Operator | Nuclio Serverless Framework | Operating System | Pipelines | Presto | Prometheus | Spark | TSDB CLI (V3IO) | TSDB Nuclio Functions | V3IO Daemon | Web APIs | Web Shell | Zeppelin
- Open-source tools and related services are subject to open-source restrictions.
- See Application Library Versions for programming application library versions that are used or certified for usage by relevant services, such as Nuclio and Jupyter Notebook.
|Dashboard||Default||The platform’s graphical user interface.||2.10.0|
|Kubernetes||Default||The Kubernetes (k8s) container orchestration system for automating deployment, scaling, and management of containerized applications. Application services in the platform run on top of Kubernetes.||k8s 1.15.5
|Nuclio Serverless Framework
|Default||Iguazio’s Nuclio Enterprise Edition serverless framework for development, deployment, and execution of serverless functions for real-time data processing and analysis.
The Nuclio dashboard is available as part of the
|Default||Iguazio’s MLrun open-source library for automating and tracking data science tasks and full workflows, including integration with Kubeflow Pipelines and the Nuclio serverless framework.||0.5.1|
|Default||The Google Kubeflow Pipelines open-source framework for building and deploying portable, scalable machine learning (ML) workflows based on Docker containers.||0.2.5|
|Default||The platform’s web-APIs (web-gateway) service, which provides access to its web APIs.||2.10.0|
|Jupyter||User-defined||The JupyterLab UI, including the Jupyter Notebook web application and shell terminals and the Conda binary package and environment manager. See also the Jupyter application-libraries compatibility matrix.||JupyterLab 1.0.10
Jupyter Notebook 6.0.3
|V3IO TSDB CLI||Default||The Iguazio V3IO Time-Series Database (TSDB) command-line interface (CLI) tool (
|V3IO TSDB Nuclio Functions||Default||The Iguazio V3IO TSDB Nuclio functions — Nuclio serverless functions that simplify working with the V3IO TSDB library.||0.5.11|
|Prometheus||User-defined||The Iguazio V3IO Prometheus distribution, which provides a version of the Prometheus systems monitoring and alerting toolkit that is packaged with the V3IO TSDB library and can be used to query time-series databases in the platform.||3.3.6 (Prometheus 2.8.0 and V3IO TSDB 0.9.6)|
|User-defined||The platform’s V3IO Frames service, which provides access to the Frames API — an open-source unified high-performance Python DataFrame API for accessing NoSQL, stream, and time-series data in the platform’s data store.
Frames is provided as a shared single-instance tenant-wide service. In cloud deployments, the service is pre-deployed and enabled by default.
0.7.36 (supported client)
The Apache Spark data-processing engine, including the following libraries:
|Default||The Presto distributed SQL query engine for big data.||323|
|Hive Metastore||Internal||An internal Apache Hive Metastore service that can be enabled for the Presto service to allow saving views and using the Presto Hive connector.||2.3.2|
|Horovod / MPI Operator
|Default||Distributed training using Kubeflow MPI Operator and Uber’s Horovod distributed deep-learning framework for creating machine-learning models that are trained simultaneously over multiple GPUs or CPUs.||0.2.3|
|Default||A platform service that uses Filebeat to forward application-service logs to be stored and indexed in an instance of the Elasticsearch search and analytics engine.
Note that this default service is disabled by default because you need to configure the URL of an Elasticsearch service for storing and indexing the logs. For more information, see Logging, Monitoring, and Debugging.
|Default||A platform service for monitoring Nuclio serverless functions and gathering performance statistics that are visualized in a Grafana dashboard. For more information, see Logging, Monitoring, and Debugging.||2.10.0|
|Grafana||User-defined||The Grafana analytics and monitoring platform.
In cloud platform environments, Grafana is currently available as a shared single-instance tenant-wide service.
|Docker Registry||Default and user-defined||A platform service for working with a Docker Registry, which is used by the Nuclio service to store the function images.
You can create a Docker Registry service and configure it to work with a remote off-cluster Docker Registry.
On the default tenant, the Nuclio service is configured by default to work with a pre-deployed default tenant-wide
|Web Shell||User-defined||A platform service that provides a web-based command-line shell (“web shell”) for running application services — such as Spark jobs and Presto queries — and performing basic file-system operations.
Note that this isn’t a fully functional Linux shell. For more information, see Application Services and Tools.
|Zeppelin||User-defined||The Apache Zeppelin web-based notebook.||0.8.2|
|Hadoop||Default||The Apache Hadoop distributed data-processing library.||2.9.0|
|Default (internal)||An internal service for integrating the platform with external applications by using the platform’s V3IO library.||2.10.0|
|Operating System||Internal||The CentOS Linux operating-system.||7.6|
Application Library Versions
The following table provides information about the versions of application libraries (packages) that are used or certified for usage with different pre-deployed platform tools and services:
|Service||Pre-deployed and Certified Application Library Versions|
|Platform API Libraries||Java 8
|Frames||Python 3.6 and 3.7|
|Nuclio||Python 3.6 and 2.7|
Iguazio platform tutorials (v3io/tutorials) 2.10.5
Iguazio V3IO Python SDK (v3io/v3io-py) 0.3.20 (Python 3.5–3.8 and 2.7)
Nuclio Jupyter package (nuclio/nuclio-jupyter) 0.8.5
NVIDIA CUDA 10.1
NVIDIA RAPIDS 0.14
Frames client 0.7.36
|Web Shell||Python 3.7
Integration with Additional Tools
You can independently install additional software tools — such as TensorFlow, PyTorch, or scikit-learn — and use them on top of the platform services. You can also configure remotely installed tools — such as Tableau or Looker — to analyze and visualize data in the platform. In addition, you can use Conda (which is available as part of the platform’s Jupyter Notebook service) and pip (which is available as part of the Jupyter Notebook, Zeppelin, and web-shell services) to install Python packages. For more information, see Application Services and Tools.