Talks I’m looking forward to at the 2013 Cassandra Summit Europe
The 2013 Cassandra Summit Europe is in just over two weeks! This is the first European summit that DataStax has organized, and it’s shaping up to be fantastic, with four tracks on 17th and full-day training classes on the 16th and 18th.
Some of the talks I’m looking forward to seeing include:
- Getting Closer to Cassandra, by Oleg Anastasyev. Oleg works on Cassandra at Odnoklassniki, one of Russia’s largest social networks. They are long-time users of Cassandra, and currently run a customized version of Cassandra 0.6 on their largest cluster. Oleg has contributed several useful improvements to mainline Cassandra as he looks at upgrading to 2.0, and I’m excited to hear more from him.
- Time series data was one of the first use cases for Cassandra and remains a popular application. Rackspace Cloud Monitoring (née CloudKick) has been doing this longer than most; check out the Blueflood talk by Cassandra committer Gary Dusbabek for a good overview of the challenges and techniques involved in monitoring 30 million signals in production.
- Building a CQL Driver, by Theo Hultberg. Theo is the author of the main Ruby client for Cassandra, cql-rb, and a veteran Cassandra architect at Swedish big data startup Burt. Expect this to be one of the more hardcore technical talks.
- Sylvain Lebresne’s talk on The State of CQL should be more focused on CQL itself, although Sylvain also wrote the Java driver. Expect Sylvain to cover the advances made in CQL this entire year, as well as coming-in-2.1-features like user-defined types.
- Cassandra internals, by Cassandra committer and consultant Aaron Morton. Another deep-in-the-weeds technical talk, for people who want to know what’s going on under the hood — which, in my experience, always helps understanding user-level behavior. Aaron’s talk on internals at the SF summit this year was very well reviewed; check out his slides for a taste.
- Another “greatest hit” from the SF summit is Tim Moreton’s talk on Real-time Analytics with Cassandra. Stay for the demo, it’s worth it.
- From Simple CQL to Time-Series Event Tracking and Aggregation Using Cassandra and Hadoop, by Mick Semb Wever. Mick has been running Hadoop against Cassandra for almost as long as it’s been supported. (About 3.5 years, for those of you keeping track.) I’m looking forward to hearing how he’s evolved FINN’s architecture along the way.
- Speaking of Hadoop, Spark is a new technology that promises to solve Hadoop-like batch calculations faster and with less impact on your cluster. I’ve been excited about Spark/Cassandra integration since seeing Evan Chan’s talk on doing exactly this at Ooyala at the SF Summit. However, Ooyala would be the first to acknowledge that writing a custom InputFormat to deal with protobuf-encoded Cassandra data is a bit idiosyncratic, so I’m looking forward to a more mainstream take on the subject by Richard Low.
- Last but not least, Jonathan Halliday’s talk on real-time ranking queries. I’ve been following his work for a while, and he has some interesting ideas for a challenging problem domain. I’m also interested in seeing how he integrates the [Thrift-based] intravert with CQL3.
See you at the Summit!