Deep dive into Apache Cassandra internals

By Jake Luciani -  March 30, 2015 | 1 Comment

At the DataStax company kickoff this year, the internal Apache Cassandra dev team gave a great series of talks for developers in other teams who wanted to learn more about how the Apache Cassandra code is structured. In an effort to help all you external developers who would love to contribute to Apache Cassandra but feel you would need hours of brain dumps from the other knowledgeable developers... here is your chance! Once you have gone through these talks and feel ready to tackle a feature or bug check out instructions on the wiki.

First up, Sam Tunnicliffe (@beobal) discusses the general internals of Cassandra including the start-up sequence, internal messaging API, replication, etc...

Video:

link to slides

Next, Josh McKenzie (@jam14392) takes us through the low level of the read and write paths.

Video:

link to slides

Next, Marcus Eriksson (@krummas) takes us deep into the exiting world of compaction and their surrounding APIs

Video:

link to slides

Finally, Tyler Hobbs (@tylhobbs) explains the CQL protocol and APIs

Video:

After the talks we asked the developers to try a coding exercise to put their new Cassandra knowledge to use. The task was to write a new CompactionStrategy for Cassandra that compacts the sstables that contain the most overlapping partitions. You can read the exercise instructions and slides here.









DataStax has many ways for you to advance in your career and knowledge.

You can take free classes, get certified, or read one of our many white papers.



register for classes

get certified

DBA's Guide to NoSQL







Comments

  1. Kevin Burton says:

    You should upload these to youtube so they’re easier to share within social media. I was going to transcode these to podcasts but it doesn’t work on google docs.

Comments

Your email address will not be published. Required fields are marked *




Subscribe for newsletter:

Tel. +1 (408) 933-3120 sales@datastax.com Offices France GermanyJapan

DataStax Enterprise is powered by the best distribution of Apache Cassandra™.

© 2017 DataStax, All Rights Reserved. DataStax, Titan, and TitanDB are registered trademark of DataStax, Inc. and its subsidiaries in the United States and/or other countries.
Apache Cassandra, Apache, Tomcat, Lucene, Solr, Hadoop, Spark, TinkerPop, and Cassandra are trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.