Deployment and Usage Guide for Running AI Workloads on Red Hat OpenShift and NVIDIA DGX Systems with IBM Spectrum Scale Deployment and Usage Guide for Running AI Workloads on Red Hat OpenShift and NVIDIA DGX Systems with IBM Spectrum Scale

Deployment and Usage Guide for Running AI Workloads on Red Hat OpenShift and NVIDIA DGX Systems with IBM Spectrum Scale

Simon Lorenz والمزيد

وصف الناشر

This IBM® Redpaper publication describes the architecture, installation procedure, and results for running a typical training application that works on an automotive data set in an orchestrated and secured environment that provides horizontal scalability of GPU resources across physical node boundaries for deep neural network (DNN) workloads.

This paper is mostly relevant for systems engineers, system administrators, or system architects that are responsible for data center infrastructure management and typical day-to-day operations such as system monitoring, operational control, asset management, and security audits.

This paper also describes IBM Spectrum® LSF® as a workload manager and IBM Spectrum Discover as a metadata search engine to find the right data for an inference job and automate the data science workflow. With the help of this solution, the data location, which may be on different storage systems, and time of availability for the AI job can be fully abstracted, which provides valuable information for data scientists.

النوع
كمبيوتر وإنترنت
تاريخ النشر
٢٠٢٠
٣٠ نوفمبر
اللغة
EN
الإنجليزية
عدد الصفحات
٦٠
الناشر
IBM Redbooks
البائع
International Business Machines Corp
الحجم
٥٨٥٫٤
ك.ب.
AI and Big Data on IBM Power Systems Servers AI and Big Data on IBM Power Systems Servers
٢٠١٩
IBM Reference Architecture for Genomics, Power Systems Edition IBM Reference Architecture for Genomics, Power Systems Edition
٢٠١٦
Red Hat OpenShift V4.3 on IBM Power Systems Reference Guide Red Hat OpenShift V4.3 on IBM Power Systems Reference Guide
٢٠٢٠
IBM Data Engine for Hadoop and Spark IBM Data Engine for Hadoop and Spark
٢٠١٦
IBM Platform Computing Solutions Reference Architectures and Best Practices IBM Platform Computing Solutions Reference Architectures and Best Practices
٢٠١٤
Implementing an IBM High-Performance Computing Solution on IBM POWER8 Implementing an IBM High-Performance Computing Solution on IBM POWER8
٢٠١٥
A Deployment Guide for IBM Spectrum Scale Unified File and Object Storage A Deployment Guide for IBM Spectrum Scale Unified File and Object Storage
٢٠١٧
Data Accelerator for AI and Analytics Data Accelerator for AI and Analytics
٢٠٢١
Implementing OpenStack SwiftHLM with IBM Spectrum Archive EE  or IBM Spectrum Protect for Space Management Implementing OpenStack SwiftHLM with IBM Spectrum Archive EE  or IBM Spectrum Protect for Space Management
٢٠١٧
Analyse empirischer Befunde zur Einrichtung von Performance Measurement-Systemen unter Berücksichtigung von Ursache-Wirkungsbeziehungen Analyse empirischer Befunde zur Einrichtung von Performance Measurement-Systemen unter Berücksichtigung von Ursache-Wirkungsbeziehungen
٢٠١١
IBM Storage for Red Hat OpenShift Blueprint IBM Storage for Red Hat OpenShift Blueprint
٢٠٢٠
Storage Multi-tenancy for Red Hat OpenShift Container Platform with IBM Storage Storage Multi-tenancy for Red Hat OpenShift Container Platform with IBM Storage
٢٠٢١
Red Hat OpenShift on IBM Z Installation Guide Red Hat OpenShift on IBM Z Installation Guide
٢٠٢٠
Red Hat OpenShift V4.3 on IBM Power Systems Reference Guide Red Hat OpenShift V4.3 on IBM Power Systems Reference Guide
٢٠٢٠
Deploying SAP Software in Red Hat OpenShift on IBM Power Systems Deploying SAP Software in Red Hat OpenShift on IBM Power Systems
٢٠٢١
Using the IBM Block Storage CSI Driver in a Red Hat OpenShift Environment Using the IBM Block Storage CSI Driver in a Red Hat OpenShift Environment
٢٠٢١