Bring data science into business applications with Iguazio’s managed and secure Data Science Platform
Automated Collection and Processing
With Iguazio’s Nuclio Serverless Functions, users easily collect data from various sources, merge streaming on the fly, transform data and store it in different formats. Nuclio provides fast and secure access to real-time and historical data at scale, including event-driven streaming, time series, NoSQL, SQL and files.
Preparation and Exploration
Iguazio provides a collaborative central platform. Data scientists easily explore and access data using a Jupyter notebook and work with popular frameworks such as Spark , Presto and Pandas. These analytics tools all run transparently from Jupyter notebook with seamless access to Iguaizo’s multi-model data layer. Users store and access data with different formats, such as NoSQL (“key/value”), time series, stream data and files (simple objects), while leveraging different tools and APIs to access and manipulate the data, all from a single development environment.
Building and Training at Scale
Iguazio’s open Python environment with built-in machine learning libraries like Scikit Learn, NumPy, Pytorch and TensorFlow, enables users to build and train models easily. When a model is ready, users validate it against a real-time production-like dataset on a distributed cluster, leveraging frameworks such as Dask, Spark, Horovod and Tensorflow with GPU support.
Users deploy models from Jupyter notebook to production with just a few clicks and in a reproducible way. The code is deployed as a function with all of its relevant configurations and is immediately ready to run in a serving layer either on Iguazio’s scalable platform or on a 3rd party cluster.
ML Pipeline Automation
Building machine learning applications is not just about creating models. In order to make them operational, a proper workflow is needed – a pipeline which streamlines the process of experiments and inferencing. Iguazio includes KubeFlow, the leading tool in the industry for running pipelines. KubeFlow is an open source Kubernetes-native platform for developing, orchestrating, deploying and running scalable and portable ML workloads.
On demand service running in the iguazio managed cloud or in alternative public multi-clouds
Co-sell ready on the Azure and Azure Stack marketplace for cloud and edge applications
A complete cloud experience running on-premises on iguazio’s scalable multi-node cluster
Edge appliance handling large data volumes with small footprint, aggregated from devices in real-time