At DataStax, we're committed to helping you acquire the knowledge and skills it takes to gain maximum value from Apache Cassandra and DataStax Enterprise. We offer free self-paced online courses, along with custom options for onsite training.
A half-day overview of DataStax Enterprise. The course covers the architecture, scaling and fault tolerance features, and data model of Apache Cassandra, the open-source technology at the heart of DataStax Enterprise. We also review full-text search with Apache Solr, analytics with Apache Hadoop and Spark, and enterprise management with OpsCenter.
A deeper dive into the core technology at the heart of DataStax Enterprise. Through lecture and numerous hands-on exercises, students learn the command-line tools required to manage a Cassandra instance, the Cassandra data model, the specifics of replication and fault tolerance, how to choose between strong eventual consistency, and Cassandra’s on-disk storage engine and compaction algorithms.
A thorough review of the tools, skills, and techniques needed to administer a Cassandra cluster, tune its performance, and diagnose and resolve common production problems. Students apply lesson content in exercises performed on real cloud-based clusters.
The Cassandra data model looks similar to the legacy relational model, but is different in important ways. This course contains a thorough treatment of the Cassandra data model, and presents the Chebotko method of translating a real-world domain model into a running Cassandra schema. The data modeling techniques presented in this course are essential to a successful DataStax Enterprise deployment.
A deep dive into Solr, the open-source engine behind DataStax Enterprise search capabilities. Through a mix of lecture and frequent hands-on labs, the course covers the integration of Solr and DataStax Enterprise. Topics include text tokenization and analysis, Solr schemas, the theory and operation of inverted indices, and the Solr query language.
A deep dive into Spark, the open-source engine behind DataStax Enterprise analytics capabilities. Students learn the Spark Context API through numerous hands-on exercises in Scala. Pair RDDs are treated in detail, including their integration with CQL tables to move data in and out of operational Cassandra structures. Students use the Action and Transformation APIs, and learn how Spark Streaming enables real-time analysis and Spark SQL enables ad-hoc SQL queries over data stored in Cassandra.