TechnologyMarch 13, 2019

Hands-on Tutorial to Install DataStax Enterprise on AWS Cloud in 3 Easy Steps

Hands-on Tutorial to Install DataStax Enterprise on AWS Cloud in 3 Easy Steps

DataStax powers the Right-Now Enterprise with the always-on, distributed cloud database built on Apache Cassandra™ and designed for hybrid cloud. So let's get started.

There are several options to install DSE which are documented here. In this tutorial, I will use the Lifecycle Manager (LCM) which allows you to easily provision, configure, and install DataStax Enterprise clusters.

Deployment Architecture

In this tutorial for simple functional testing, I have used t2.medium instance type on AWS which is sufficient.

Here are some guidelines to consider as you plan your Enterprise implementation https://docs.datastax.com/en/dse-planning/doc/planning/planningHardware.html

1.) Provision your Virtual Machine on AWS

       a.) Configure Security Rules

 

Navigate on AWS to Services → Compute →(Click on) EC2

On the left-hand side, you will see Network and Security →Security Groups →(Click on) Create Security Group

I have created a new security group as highlighted below. Note the name of this security group “sg-00365e9a79b7a4f3a”. It will be used as we add the inbound rules for internode communication.

Internode Communication

Edit the Inbound Security rules to look like below.

Reference

Inbound Security Rules

 

Edit Inbound Rules

You don’t need to make any changes to the Outbound Rule.

       b.) Create Virtual Machine on EC2

We will create 4 VM’s. One for OpsCenter and 3 for the distributed database Nodes

So navigate to Service → Compute → EC2 → Launch Instance → Search for “Ubuntu Server 16.04 LTS (HVM), SSD Volume Type”

Choose AMI

Change Instance Type to T2.Medium

Choose an Instance Type

Click: Next Configure Instance Details and change Number of instances to 4

Configure Instance Details

You can keep the storage default, which is 8GB, for this initial functional test. (You will need to consider adding additional storage “as needed” for your test case). Click on Add Tags.

Click on Next: Configure Security Group.

Use the existing security group which we created before.

Configure Security Group

Click Review and Launch. It will also ask you to create a new SSH key. Make sure to save it. Now you would have 4 VM’s ready.

Launch Instance

Note: In AWS the IP address changes once you bounce the VM. So navigate to Network & Security → Elastic IPs. Allocate 4 IP’s and associate them to the VM’s. This way we can preserve the IP address even if the VM’s gets rebooted.

Allocate New Address

2.) Install OpsCenter (Visual Management and Monitoring)

  a.) Login to the OpsCenter Server on AWS

ssh -i ".pem" ubuntu@

       b.) Install Python 2.7

sudo apt update sudo apt install python2.7 python-pip python -V

A successful python install should look

Python Install

       c.) Install Java 1.8

sudo apt-get update sudo apt-get install openjdk-8-jdk java -version

Java Version

       d.) Install OpsCenter

Add the datastax Repository File and a Key

echo "deb https://DSA_email_address:downloads_key@debian.datastax.com/enterprise stable main" | sudo tee -a /etc/apt/sources.list.d/datastax.sources.list curl -L https://debian.datastax.com/debian/repo_key | sudo apt-key add - sudo apt-get update

You can generate your " downloads_key " from https://academy.datastax.com/downloads

Downloads Key

sudo apt-get install libaio1 sudo apt-get install opscenter=6.7.0 sudo service opscenterd start ps -ef|grep opscenter

A successful startup of OpsCenter will show you a background process as below.

OpsCenter

OpsCenter runs on Port 8888.

Lets validate using and access it using

http://:8888/

DataStax OpsCenter

3.) Deploy a 3 node cluster using LCM (OpsCenter Life Cycle Management)

Access the OpsCenter as seen in the previous steps, select “create a new cluster” and click Get Started.

Lifecycle Manager

1. Add SSH Credential

OpsCenter LCM will use this key to access the VM’s created on AWS to deploy the cluster.

Enter Login User Name as: ubuntu

Copy paste the Private Key used to create the VM’s on AWS.

And Save.

Add Credential

 

2. Add Config Profile

The config profile will deploy the version of DSE you wish to install. I have selected the latest version which is 6.7.2 from the drop-down box. And click Save.

Config Profiles

3. Add Repository

OpsCenter LCM will connect to the repository and download the software.

Repositories

4. Add a Cluster

Let's name the cluster.

Select ssh credentials

Select the repository

Select the config profile

Enter the OLD password as cassandra

Enter the new password as per your password policy

Click Save.

Clusters

Click on the Cluster and a Datacenter tab will show up. Click the + sign on the datacenter tab and enter the below details.

Name for the DataCenter: DC1

Select the config profile from the drop-down box.

Select the ssh credentials.

Select the workload. In this example, I have selected a Cassandra WorkLoad.

Cassandra Workload

Add the Nodes

Click on the + sign next to the nodes tab.

Nodes

Enter node name as N1

Enter Rack as Rack1

Enter the Private IP address of the VM that we created. Remember we

Select the ssh credentials and config profile from the drop-down box.

Click Save

Add Node

Similarly, Add the other two nodes N2 & N3 as seen below.Add Nodes

Click on the 3 dots you see next to the DEV in the Clusters Tab and click on install.

Clusters Tab

Keep the defaults and click Submit.

Run Installation Job

In the next couple of minutes, OpsCenter LCM will download and install the software on all the 3 nodes and create a one single DSE Cluster ready to use.

As successful install would look like this.

Lifecycle Manager

Click on OpsCenter URL to monitor and manage your cluster.

OpsCenter

One-Stop Data API for Production GenAI

Astra DB gives developers a complete data API and out-of-the-box integrations that make it easier to build production RAG apps with high relevancy and low latency.