Skip to content

Overview

This is a reference design and implementation of an environment template to provision a self-service AI/ML environment for data scientists powered by Kubeflow on an Amazon EKS cluster.


Source Code for Template

The Terraform source code and related assets are available in a public Git repo as shown below

Git Repo


Overview

In this self-paced exercise, you will setup and use Environment Manager to build and manage resources in AWS. These resources will be controlled as a single environment through the controller.

Environment Manager enables a self-service model for Development and DevOps teams (Admins) to create environments while giving enough control to the central Ops, SRE and Platform teams (End Users) to enforce security, cost, governance guardrails and standardization.

In this exercise, the environment resources that will be created in AWS are defined in Terraform that are stored in a Git repository.

The following resources will be created as part of the environment:

  • VPC
  • EKS Cluster
  • Kubeflow

Environment Diagram

This exercise is expected to take ~60 minutes to complete.

A setup script will be used to assist in setting up the pre-requisites for this guide. The setup script will perform the following actions:

  • Create a repository in the controller
  • Create a Secret Sealer in the controller
  • Create a GitOps Agent that will run on the machine where the setup script is executed from
  • Create a GitOps System-Sync pipeline in the controller that will be used to sync resources between the controller and the forked Git repository

The sequence diagram below illustrates the steps you will experience at a high level.

Important

Either the Environment Template User or the Infra Admin can perform the deploy/destroy operations on an environment.

sequenceDiagram
    autonumber
    participant admin as Platform Team
    participant rafay as Rafay
    participant user as Developer


    rect rgb(191, 223, 255)
    Note over admin,rafay: Setup Environment Template
    admin->>admin: Clone Git Repo
    admin->>rafay: Setup Env Manager <br>Resources
    admin->>rafay: Input Cloud Credentials
    end

    rect rgb(191, 223, 255)
    Note over rafay,user: Provision & Use Environment
    user->>rafay: Create Environment
    user->>rafay: Use Environment
    user->>rafay: Destroy Environment
    end

What Will You Do by Part

Part What will you do?
1 Setup and Configure Environment Manager
2 Manage the lifecycle of a AWS environment

Assumptions

  • You have access to a AWS Account
  • You have access to a Mac or Linux machine
  • You have a Git client on your machine that is setup for push/pull
  • You have Docker installed on your machine