GCP Deployment Specifications
This document lists the hardware specifications for deployment of version 3.0.3 of the Iguazio Data Science Platform ("the platform") on Google Cloud, also known as Google Cloud Platform (GCP); for details, refer to the Google Cloud Compute Engine documentation.
The data-node instances include Non-Volatile Memory Express (NVMe) SSD-based instance storage, which is optimized for low latency, very high random I/O performance, and high sequential read throughput. The data doesn't persist on the NVMe if the instance is stopped.
Don't attempt to shut down any of the data nodes, as it will erase the data.
The platform is available in two configurations, which differ in a variety of aspects, including the performance capacity, footprint, storage size, and scale capabilities:
- Development Kit
- A single data-node and single application-node cluster implementation. This configuration is designed mainly for evaluation trials and doesn't include high availability (HA) or performance testing.
- Operational Cluster
- A scalable cluster implementation that is composed of multiple data and application nodes. This configuration was designed to achieve superior performance that enables real-time execution of analytics, machine-learning (ML), and artificial-intelligence (AI) applications in a production pipeline. The minimum requirement for HA support is three data nodes and three application nodes.
Both configurations also support an additional backup node for backing up the platform instance.
GCP Data-Node Specifications
Data nodes in platform GCP deployments must fulfill the following hardware specification requirements:
|Data disks (local storage)||8 x 375 GB NVMe SSD|
|OS boot disk||Premium SSD; 400 GB (minimum)|
|Usable storage capacity||1 node (Development Cluster) — 2 TB;
3 nodes (Operational Cluster) — 3.5 TB
GCP Application-Node Specifications
Application nodes in platform GCP deployments are supported only on Google Kubernetes Engine (GKE) and must use one of the following instance types; choose the type that best fits your requirements. For specification details for each type, refer to the Google Cloud Compute Engine documentation.
- c2-standard-16 (default configuration)
GCP Backup-Node Specifications (Optional)
If you wish to back up your instance of the platform, you need an additional backup-node instance of type c2-standard-16.
- It's strongly recommended that you back up your data on a regular basis.
- The backup node is used only for backups and can be shut down between backups to save costs.
- The backup node must have at least 2 TB of network-attached storage (NAS) to be used only for backup purposes. The exact amount of required storage depends on the amount of data that's being used in the platform; consult Iguazio's support team.