DS201: Cassandra Core Concepts

Description: A deeper dive into the core technology at the heart of DataStax Enterprise. Through lecture and numerous hands-on exercises, students learn the command-line tools required to manage a Cassandra instance, the Cassandra data model, the specifics of replication and fault tolerance, how to choose between strong eventual consistency, and Cassandra’s on-disk storage engine and compaction algorithms.
Length: 4 days online/3 days in person
Prerequisites: Prior software administration or development experience
Audience: Administrators and developers seeking to use Apache Cassandra 2.0. This course serves as a foundation and prerequisite for other DataStax Training courses.
Environment: Virtual Machine pre-configured with Cassandra, related tooling, and exercise files.

Learning Objectives

Introducing Big Data and Cassandra

  • Understand Big Data
  • Describe Apache Cassandra
  • Understand common use cases
  • Survey the Cassandra architecture

Installing, configuring and running Cassandra locally

  • Prepare the operating system
  • Select and install a Cassandra version
  • Configure Cassandra for a single node
  • Start and stop a Cassandra instance

Introducing the Cassandra Data Model and Cassandra Query Language

  • Describe the Cassandra data model
  • Introduce cqlsh
  • Describe the DDL subset of CQL
  • Introduce DevCenter
  • Describe the DML subset of CQL
  • Understand basics of data modeling for Cassandra

Introducing Cassandra tools

  • Configure nodes and clusters using CCM
  • Introduce and use cqlsh shell commands
  • Introduce and use nodetool
  • Populate and test nodes using cassandra-stress
  • Identify additional Cassandra tools

Understanding Cassandra’s internal architecture

  • Understand how requests are coordinated
  • Understand replication
  • Understand and tune consistency
  • Understand how nodes communicate
  • Understand the system keyspace

Working with the Cassandra write path

  • Understand how data is written to the storage engine
  • Understand the data directories

Working with the Cassandra read path

  • Understand how data is read from the storage engine
  • Introduce anti-entropy operations
  • Read data from Cassandra

Understanding Compaction

  • Understand Cassandra Compaction and its necessity
  • Choose and implement compaction strategies

Introducing hardware planning and troubleshooting

  • Introduce best practices in hardware implementation
  • Introduce troubleshooting resources

For more information, contact us.