Deep dive into Apache Cassandra internals
At the DataStax company kickoff this year, the internal Apache Cassandra dev team gave a great series of talks for developers in other teams who wanted to learn more about how the Apache Cassandra code is structured. In an effort to help all you external developers who would love to contribute to Apache Cassandra but feel you would need hours of brain dumps from the other knowledgeable developers... here is your chance! Once you have gone through these talks and feel ready to tackle a feature or bug check out instructions on the wiki.
First up, Sam Tunnicliffe (@beobal) discusses the general internals of Cassandra including the start-up sequence, internal messaging API, replication, etc...
Next, Josh McKenzie (@jam14392) takes us through the low level of the read and write paths.
Next, Marcus Eriksson (@krummas) takes us deep into the exiting world of compaction and their surrounding APIs
Finally, Tyler Hobbs (@tylhobbs) explains the CQL protocol and APIs
After the talks we asked the developers to try a coding exercise to put their new Cassandra knowledge to use. The task was to write a new CompactionStrategy for Cassandra that compacts the sstables that contain the most overlapping partitions. You can read the exercise instructions and slides here.
DataStax has many ways for you to advance in your career and knowledge.
You can take free classes, get certified, or read one of our many white papers.
register for classes
DBA's Guide to NoSQL