Spark Migration Tool for Astra DB

Migrate data from an existing Cassandra cluster to Astra DB using a Spark application.

40 minutesExpert

Updated September 14, 2021


Spark Migration Tool for Astra DB

40 minutes, Expert, Start Building

Migrate data from an existing Cassandra cluster to Astra DB using a Spark application.

image

Objectives

  • Leverage Spark to migrate data from a Cassandra cluster to Cassandra on Astra DB.

How this works

We're using Spark to migrate data from a Cassandra cluster to Cassandra on Astra DB.

Get Started

To build and play with this app, follow the build instructions that are located here: https://github.com/DataStax-Examples/astra-spark-migration

Running the Astra DB to Spark Migration Tool

Follow the instructions below to get started.

Prerequisites

Let's do some initial setup by creating a serverless(!) database.

DataStax Astra

  1. Create a DataStax Astra account if you don't already have one: image

  2. On the home page. Locate the button Create Database image

  3. Locate the Get Started button to continue image

  4. Define a database name, keyspace name and select a database region, then click create database. image

  5. Your Astra DB will be ready when the status will change from Pending to Active 💥💥💥 image

  6. After your database is provisioned, we need to generate an Application Token for our App. Go to the Settings tab in the database home screen. image

  7. Select Admin User for the role for this Sample App and then generate the token. Download the CSV so that we can use the credentials we need later. image

  8. After you have your Application Token, head to the database connect screen and select the driver connection that we need. Go ahead and download the Secure Bundle for the driver. image

  9. Make note of where to use the Client Id and Client Secret that is part of the Application Token that we generated earlier. image

Github

  1. Click Use this template at the top of the GitHub Repository: image

  2. Enter a repository name and click 'Create repository from template': image

  3. Clone the repository: image

Create your Account and get started!

Let's Go!
View on GitHubDeploy to NetlifyDeploy to Vercel