Event Details

The C* Summit is the premier global conference for the Apache Cassandra community.

  • Join 1,000 other experts just like you; people with a passion for Apache Cassandra
  • Real customer use cases for big data platforms are the heart of this conference. Come learn best practices from leaders in the field.
  • Learn how to push development boundaries, deliver new solutions, and grow your career.
  • Get certified as a DataStax Certified Cassandra Developer during the Summit.
  • Have fun!  Meet the experts, make new friends, and discover beautiful San Francisco.

Meet the Experts

Tips, tricks, and training from the Apache Cassandra experts.  We all benefit from a little help at work.  Meet the experts with the answers to your toughest challenges, common solutions to everyday problems, and opportunities to try a new approach. There will be "Birds of a Feather" sessions moderated by Expert Developers and DataStax CTO, plus a potential "Stump the Experts" dev session.

Have Fun

All work and no play?  Not at Cassandra Summit.  We like to party before, during, and after the summit:

  • After party: Networking cocktail hours from 5:30 – 8PM will include food and music. Stick around to see the lights come on in the City.

Inspiration and fun served daily with food, drink, and socializing. Between the variety of sessions and interactive Q&A, we will be offering a closing session of "Lightning Rounds" which have always been one of the very best showcases of emerging talent and rockstar companies.

C*areer Fair

With more than 100 big data experts, cloud developers, and Apache Cassandra gurus at the 2013 Cassandra Summit, what better place to network and discuss career possibilities within the industry? Our sponsors’ Talent Acquisition teams will be on hand to answer questions about the growth and employment opportunities within their companies. Don’t forget to stop by and say hello!

The DataStax Experience

The DataStax Experience: cutting edge while comfortable, exciting while enterprising, demanding while delicious. DataStax has brought big data to the enterprise, which means we help you walk that line between doing the very latest and having the very best -- proven for your most critical online business applications. Trust us; you'll love it.

Schedule

Tracks: - Technical    - Use Case    - Executive

Day 1, June 11

Track 1
Festival Pavilion
Track 2
Golden Gate Room
Conference Center
Track 3
Fleet Room
Building D
Track 4
Marina Room Conference Center
Track 5
Firehouse Building
Meet the Experts
Building C370
7:00AM-9:00AM Open Registration
7:30AM-8:30AM
Press Relations Breakfast - Invite Only
Breakfast
8:45AM-9:30AM
Keynote Sessions
Billy Bosworth, DataStax CEO
Jonathan Ellis, Apache Cassandra Chair & DataStax CTO
9:30AM-10:20AM
Thomas J. Glazier, Accenture
10:20AM-10:50AM Break
10:50AM-11:20AM
Adrian Cockcroft, Netflix
Eric Lubow, SimpleReach
Jeff Smoley and Derek Bromenshenkel, NativeX (formerly W3i)
Darshan Rawal, Openwave Messaging
Isaac Rieksts, Health Market Science
CERT
11:20AM-11:50AM
Lara Shackelford, DataStax; Sean Knapp, Ooyala; Stefan Piesche, Constant Contact ; Terrell Deppe, HealthCare Anytime; Stephen O'Sullivan, Silicon Valley Data Science
Jesse Young, Zonar Systems
11:50AM-12:20PM
Mark Davis, Dell
Evan Chan, Ooyala
Mike Heffner, Librato
Mohit Anchlia, Intuit
12:30PM-1:30PM Lunch
1:30PM-2:00PM
Ken Krugler, Scale Unlimited
Christos Kalantzis, Netflix
Charles Lamanna and Ricardo Villalobos, Microsoft
Aaron Stannard, Marked Up
Billy Bosworth, DataStax, John Whittaker, Dell & Vincent Dell'Anno, Accenture
Meet the Experts
2:00PM-2:30PM
Sam Heywood, Gazzang
Matt Pfeil, DataStax, Rick Branson, Instagram, Adrian Cockcroft, Netflix
2:30PM-3:00PM
Matthias Broecheler Aurelius
Joe Maguire, Analyst/Author
Eric Evans, OpenNMS
Andy Cobley, University of Dundee
Sean Knapp, Ooyala
3:00PM-3:30PM Break
3:30PM-4:00PM
Terrell Deppe, HealthCare Anytime
Jason Brown, Netflix
Andrew Noonan, Gnip
Lewis John McGibbney and Renato Javier, Stanford
Godfrey Sullivan, Splunk
4:00PM-4:30PM
Tim Moreton, Acunu
5:15PM-6:15PM The Next Great Data Developer Contest
6:15PM-8:30PM Happy Hour


Day 2, June 12

Track 1
Festival Pavilion
Track 2
Golden Gate Room
Conference Center
Track 3
Fleet Room
Building D
Track 4
Marina Room Conference Center
Track 5
Firehouse Building
Meet the Experts
Building C370
7:00AM-9:00AM Open Registration
7:45AM-9:15AM Networking Breakfast
9:30AM-10:00AM
Michael Kjellman, Barracuda Networks
Matt Kennedy, Fusion I/O
Dave Gardner, Hailo
Matt Stump, KISSMetrics
Christos Kalantzis, Netflix and Billy Bosworth, DataStax
CERT
10:00AM-10:30AM
10:30AM-11:00AM Break
11:00AM-11:30AM
Patrick McFadin, DataStax
Les Hazelwood, StormPath
Al Tobey, Ooyala
Renat Khasanshyn, Altoros and Cornelia Davis, Pivotal
Zorawar Biri Singh and Gurvinder Singh Ahluwalia, IBM
CERT
11:30AM-12:00PM
Jay Patel, eBay
Manish Sood, Reltio
12:00PM-1:10PM Lunch
1:10PM-1:40PM
Ameet Chaubal and Fausto Inestroza, Accenture
Jason Rutherglen, DataStax
David Leimbrock, Riptide IO
Peter Goldmacher, Cowen and Company
Meet the Experts
1:40PM-2:10PM
Cassandra Internals
Aaron Morton, Apache Cassandra Committer
Rich Hammel, Brocade and Vivek Ganesan, Impetus Technologies
Colin Charles, Monty Program Ab
2:10PM-2:40PM
Eddie Satterly, Splunk
Jake Luciani and Carl Yeksigian, BlueMountain Capital
DeWayne Filppi, GigaSpaces
Stefan Piesche, Chief Technology Officer at Constant Contact
Guided Discussion
2:40PM-3:10PM Break
3:10PM-3:40PM
Aaron Morton, Apache Cassandra Committer
Boris Wolf, Comcast
Chris McEniry and Igor von Nyssen, Sony Network Entertainment
Sylvain Lebresne, DataStax
Brian Gentile, Jaspersoft
3:40PM-4:10PM
Rick Branson, Instagram
4:10PM-5:00PM
Feng Qu and Anurag Jambhekar, eBay
Sameer Farooqui
Axel Lijencrantz, Spotify
Michael Figuiere and Patrick McFadin, DataStax

Sessions

Adaptive Data Convergence for Life Sciences

Speaker: Manish Sood, CEO & Founder at Reltio
The Life Sciences industry is undergoing significant changes in how companies do business due to recent legislative changes. The evolving landscape is forcing the Pharmaceutical companies to change how their entire Sales model and move from Prescriber based sales model to an Account based sales model, which has a downstream impact on Sales team organization, Field sales alignment, Incentive compensation and Marketing. This changing reality also requires that the Pharmaceutical companies understand and drive the changes to business strategy on insights driven by data about Prescription Sales, Medication Adherence, Claims, etc. to name a few categories of data sources. The required insights are derived from the convergence of data from multiple sources that include numerous internal applications, 3rd party data sources and social media. In this session, learn how Reltio is helping various Pharmaceutical companies cope with the evolving business landscape with a data driven strategy by leveraging the Reltio data science engine that runs on Cassandra.

Aligning Technology Infrastructure With Horizontal Business Growth

Speaker: Stefan Piesche, Chief Technology Officer at Constant Contact
During this presentation Stefan Piesche, Chief Technology Officer at Constant Contact, will discuss how he and his team were able to grow and scale Constant Contact’s technology infrastructure by aligning technology with horizontal business growth to improve performance and reduce costs. He will share some of the lessons learned, best practices, and recommendations for other technology executives looking to transform their technology infrastructure to business.

Analytic Fabric, The Future of Analytic Computing

Speaker: Brian Gentile, Chairman and CEO at Jaspersoft
Once you figure out how to capture every bit of data possible, then what? When it comes to business decisions, the first step is to stop thinking about 'data' and 'big data' as two different things. The smartest companies are the ones who leverage all their data to make intelligent, timely decisions. At Jaspersoft, Brian gets to see the best, and worst, business intelligence practices of companies around the world. In this session he will share some industry-wide insights that you can apply to your own unique challenges.

Are You Flexible Enough? Thoughts on the Future of Big Data and the Cloud

Gurvinder Singh Ahluwalia, CTO for Cloud Computing, West US Region at IBM and Zorawar Biri Singh, Technology Thought Leader
Is there a "right" role for the cloud in your company's future? Are there too many risks and costs associated with public clouds? Is hybridization of on premises and cloud a viable strategy? In this session, we will explore the pros and cons of cloud architecture as it relates to your big data projects.

Big Data Analytics – Realize the Investment from Your Big Data Clusters

Speaker: Mark Davis, Principal Engineer at Dell
The term "big data" seems to be everywhere these days. With the ever growing number of attendees at big data and Hadoop events, it’s clear big data is here to stay. But what does that mean for the analytics market, and how does big data fit into the picture? This session, featuring Mark Davis, Sr. Product Architect at Dell, will explore what big data means in a practical sense to the IT department. It will also explore the many ways that big data affects an organization’s picture of performance. Plus, see how big data analytics, using technologies like Cassandra and Hadoop, will converge with traditional business intelligence to create a complete picture of the enterprise's information assets, thereby giving the business a complete and insightful view of its operational efficiency.

Big Architectures for Big Data

Speaker: Eric Lubow, CTO and Co-founder at SimpleReach
Having many different technologies within an organization can be problematic for developers and operations alike. Structuring those systems into discrete modules not only abstracts away a lot of the complexity of a heterogeneous architecture, it also allows the evolution of systems using common access and storage patterns. This session will discuss how to think about, architect, and maintain a service architecture for a big data system.

Buy It Now! Cassandra at eBay

Speaker: Jay Patel, Technical Architect at eBay
This session will cover various use cases for Cassandra at eBay. It’ll start with overview of eBay’s heterogeneous data platform comprised of SQL & NoSQL databases, and where Cassandra fits into that. For each use case, Jay will go into detail of system design, data model & multi-datacenter deployment. To conclude, Jay will summarize the best practices that guide Cassandra utilization at eBay.

Can't we all just get along? MariaDB and Cassandra

Speaker: Colin Charles, Chief Evangelist at Monty Program Ab
The Cassandra Storage Engine allows access to data in a Cassandra cluster from MariaDB. Learn what the Cassandra Storage Engine is and how to make use of it, how we implemented it using dynamic columns in MariaDB. Also, we'll look at CQL, data and command mapping, use cases and benchmarks.

Cassandra at Instagram

Speaker: Rick Branson, Infrastructure Engineer at Instagram
Cassandra is a critical part of Instagram's large scale site infrastructure that supports more than 100 million active users. This talk is a practical deep dive into data models, systems architecture, and challenges encountered during the implementation process.

Real World, Real Time Data Modeling

Speaker: Tim Moreton, CTO at Acunu Ltd
Data modeling for Cassandra presents a new set of challenges, especially for developers with a background in relational data modeling. And there are added complexities in modeling for analytic applications which need to enable statistical functions over the data, but a good data model, exploiting Cassandra's strengths, can make all the difference to a successful project. This tutorial will examine a number of real-world customer data modeling examples and draw out some hints and tips that will benefit hnot just the Cassandra newbie, but also the more experienced data modeler.

Cassandra on Cloud Foundry

Speakers: Renat Khasanshyn, Founder and CEO at Altoros and Cornelia Davis, Senior Technologist at Pivotal
Coupling Cassandra with a Platform as a Service may significantly simplify the process of deploying Cassandra and applications that utilize it, reduce the cost of managing Cassandra within the organization, and to allow infrastructure service providers a simple path to offering database as a service to their customers. Attendees will learn why and when use Cassandra atop of Cloud Foundry, the history of Cassandra service within Cloud Foundry, the State of Cassandra integration with Cloud Foundry, how to create and manage Cassandra nodes on Cloud Foundry and what to expect in the next 6 months.

Cassandra on Flash: Performance & Efficiency Lessons Learned

Speaker: Matt Kennedy, Big Data Solutions Architect at Fusion-IO
Flash Memory technology, deployed as server-side PCIe or solid state disks (SSDs), is emerging as a critical tool for performance and efficiency in data centers of all scales. This presentation will discuss how the use of Flash impacts Cassandra deployments in terms of configuration, DRAM requirements and performance expectations. Ideas on leveraging C*'s cutting-edge data-center awareness to blend flash and disk storage nodes for cost and workload efficiency will also be shared. Flash media itself will be examined from a physical perspective to understand endurance issues. Data on write amplification under bulk-load and operational workload conditions will be presented to explain the impact to Flash of C*'s Log Structured Merge Tree architecture and the associated compactions. Finally, we will examine strategies to make Cassandra more Flash-aware using both conventional techniques as well as emerging Non-volatile memory (NVM) programming capabilities. Lessons learned from real-world customer deployments will be shared to complete this presentation.

CMB: An Open Message Bus for the Cloud

Speaker: Boris Wolf, Lead Engineer CMB Project at the Comcast Silicon Valley Innovation Center
The Comcast Silicon Valley Innovation Center has developed a general purpose message bus for the cloud. The service is API compatible with Amazon's SQS/SNS and is built on Cassandra and Redis with the goal of linear horizontal scalability. This presentation offers and in-depth look at the architecture of the system and how they employ Cassandra as a central component to meet key requirements. Latest feature enhancements and performance data will also be covered.

Comparing Architectures: Cassandra vs the Field

Speaker: Sameer Farooqui, Freelance Big Data Consultant and Trainer
Have you wondered what actually happens when you submit a write to Cassandra? This vendor agnostic technical talk will cover the internals of the read and write paths of Cassandra and compare it to other NoSQL stores, especially HBase so you can pick the right database for your project. Some of the topics mentioned are consistency levels, memtables/memstores, SSTables/HFiles, bloom filters, block indexes, data distribution partitioners and optimal use cases.

Crossing the Chasm: SQL to NoSQL

Speaker: Isaac Rieksts, Software Development at Health Market Science
Over the past few years, Health Market Science has transitioned from traditional relational databases and enterprise systems to a massively scalable Big Data platform that combines Cassandra and Storm to ingest thousands of feeds of data from the health market industry to produce a single high-quality masterfile. Come hear the "Why?", "What for?" and "How?" of that evolution.

The Darwinism of Data Management Systems: the evolution required to support evolving data types

Speaker: Peter Goldmacher, Managing Director at Cowen and Co.
There is no denying that the world of data has changed dramatically. If your ecosystem does not evolve, you can end up in a barren wasteland. Conversely, making the right choices can lead to a future of bounty. Peter was one of the first analysts to see the potential impact of big data on traditional vendors. In this session, he will share insights that will help you evolve your business in fundamental and lasting ways.

Data as Competitive Advantage in Manufacturing

Speakers: Rich Hammel, Director of Advanced Manufacturing at Brocade and Vivek Ganesan, Principal Architect at Impetus Technologies
Almost 10 years ago in a hotel room in Asia , his first parser was born. That parser and its offspring have supported the development of world-class networking products at Brocade. This discussion will include how big data will change manufacturing, the essential ingredients for success in greenfield big data projects, and what it's like to be obsessed with quality.

Data Driven Retail: How one mega-retailer drove down energy costs across 7,000 stores

Speaker: David Leimbrock, CTO at Riptide IO
How do you keep up with the velocity and variety of data streaming in from all the smart devices that run the physical environments of 7,000+ stores? What about getting analytics that tell you exactly where energy waste is happening in real-time? In this talk, Riptide IO, describes their blueprint for collecting, organizing and deriving real-time operational intelligence from smart devices such as lighting, HVAC, sensors and more. Learn how this retailer gained a dramatic boost to their sustainability program, and solved some of the major bottlenecks in managing countless devices across thousands of stores.

Data Modelers Still Have Jobs: Adjusting For the NoSQL Environment

Speaker: Joe Maguire, Founder at Data Quality Strategies, LLC
Using concrete, real-world examples, the presenter will show the following: How abandoning modeling altogether is a recipe for disaster, even in—or especially in—NoSQL environments; How experienced relational modelers can leverage their skills for NoSQL projects; How the NoSQL context both simplifies and complicates the modeling endeavor.How lessons learned modeling for NoSQL projects can make you a more effective modeler for any kind of project

Deciding Dollars: It's Actually Actuarial

Speaker: Matt Pfeil, Vice President of Customer Solutions at DataStax; Rick Branson, Infrastructure Engineer at Instagram; Adrian Cockcroft, Cloud Architect at Netflix
In today's world, data is growing faster than ever. For online apps, two things matter more than anything else for the database: uptime and performance. The intersection between data growth and online requirements results in interesting technology choices. This panel will discuss the implications - and approaches - to maximize revenue via technology decisions.

Distributed Graph Computing with Titan and Faunus

Speaker: Matthias Broecheler, CTO at Aurelius
This presentation introduces Titan, Faunus, and scalable graph computing in general. We present a case study of how Pearson builds an education social network on top of Titan, Faunus, and Cassandra to support learning in the 21st century. Titan is an open source distributed graph database build on top of Cassandra that can power real-time applications with thousands of concurrent users over graphs with billions of edges. Faunus is an open source global graph processing engine build on top of Hadoop and compatible with Cassandra that can analyze graphs, compute graph statistics, and execute global traversals. Titan and Faunus are components of the Aurelius Graph Cluster which enables scalable graph computation and powers applications in social networking, recommendation engines, advertisement optimization, knowledge representation, health care, education, and security.

Does Big Data Drive Apps, or Do Apps Drive Big Data?

Speaker: Godfrey Sullivan, CEO at Splunk
Godfrey Sullivan has seen it all when it comes to the technology world. Especially when it comes to line of business applications, big data, and big ideas. From his early days at Apple, to CEO positions at Hyperion and Splunk, to board seats at Citrix and Informatica, Godfrey has developed an amazing understanding of the business value and potential in data. In this discussion, we will explore the drivers for big data. What comes first? The applications, the data, or the customers?

Dude, Where's My Tweet? Taming the Twitter Firehose

Speaker: Andrew Noonan, Developer at Gnip
Gnip ingests and must serve out hundreds of millions of social activities every day and social platforms are only growing. This makes the scalability of applications essential for Gnip. Enter Cassandra. Problem solved, right? Not exactly, Gnip's relationship with Cassandra was not all rainbows and unicorns. In this session we will walk you through why we began looking at Cassandra as a data store in the first place and the valuable lessons we with Cassandra that has made it an invaluable part of our infrastructure.

Eventual Consistency != Hopeful Consistency

Speaker: Christos Kalantzis, Engineering Manager of Cloud Persistence Engineering at Netflix
This session will address Cassandra's tunable consistency model and cover how developers and companies should adopt a more Optimistic Software Design model.

Ground Traffic Control - Logistics with Cassandra

Speaker: Jesse Young, Director of Research at Zonar Systems
Come learn about how Zonar Systems uses Cassandra for logistics use cases such as tracking fleets of school buses and other fleet management services. Zonar uses Cassandra because because of its ability to scale horizontally, its continuous availability and operational ease. This talk will cover details about the implementation and our 3 year journey that got us here, including the challenges along the way.

Hardware Agnostic: Cassandra on Raspberry Pi

Speaker: Andy Cobley, Lecturer at University of Dundee
The raspberry Pi is a credit-card sized $25 ARM based linux box designed to teach children the basics of programming. The machine comes with a 700MHz ARM and 512Mb of memory and boots off a SD card, not much power for running the likes of a Cassandra cluster. This presentation will discuss the problems of getting Cassandra up and running on the Pi and will answer the all important question: Why on Earth would you want to do this!?

How Not to Use Cassandra

Speaker: Axel Liljencrantz, Backend Developer at Spotify
At Spotify, we see failure as an opportunity to learn. During the two years we've used Cassandra in our production environment, we have learned a lot. This session touches on some of the exciting design anti-patterns, performance killers and other opportunities to lose a finger that are at your disposal with Cassandra.

High Throughput Analytics with Cassandra

Speakers: Aaron Stannard, Founder and at Marked Up Analytics
Building analytics systems is an increasingly common requirement for BI teams inside companies both big and small, and a feat made even more challenging when analytic results have to be produced in real-time. In this presentation the team from MarkedUp Analytics will show you techniques for leveraging Cassandra, Hadoop, and Hive to build a manageable and scalable analytics system capable of handling a wide range of business cases and needs.

Hindsight is 20/20. MySQL to Cassandra

Speaker: Michael Kjellman, Software Engineer at Barracuda Networks
Abstract A brief intro to how Barracuda Networks uses Cassandra and the ways in which they are replacing their MySQL infrastructure, with Cassandra. This presentation will include the lessons they've learned along the way during this migration.

In Case of Emergency, Break Glass

Speaker: Aaron Morton, Apache Cassandra Committer
The design of Apache Cassandra allows applications to provide constant uptime. Peer-to-Peer technology ensures there are no single points of failure, and the Consistency guarantees allow applications to function correctly while some nodes are down. There is also a wealth of information provided by the JMX API and the system log. All of this means that when things go wrong you have the time, information and platform to resolve them without downtime. This presentation will cover some of the common, and not so common, performance issues, failures and management tasks observed in running clusters. I'll discuss how to gather information and how to act on it. Operators, Developers and Managers will all benefit from this exposition of Cassandra in the wild.

It's Like Your Parents: Relational and NoSQL can co-exist

Speaker: Sean Knapp, Executive Vice President and Chief Product Officer at Ooyala
25% of the world's online video traffic is analyzed by Ooyala. So it's a good thing that when Sean founded the company, he didn't worry about pointless marketing battles between this or that technology. Instead, he set out to solve real problems with no pre-conceived biases. In this session, you will learn how he ended up with a blend of NoSQL, relational, and Hadoop; each playing their part in a new era of application design.

Java and .NET Client Drivers - Cassandra development on fire

Speakers: Michael Figuiere and Patrick McFadin, Principal Solutions Architect at DataStax
Cassandra 1.2 finalizes CQL3 and introduces a new binary protocol for client/server communication. These two components are the foundation of the new line of drivers developed by DataStax. Based on years of experience with Cassandra, these new drivers for Java, .Net and Python come with an asynchronous and lightweight architecture, a clean and simple API, a standardized way to discover nodes and to manage load balancing and fail over. This presentation will give an in depth look at these new drivers which will make your Cassandra-based applications even more robust, efficient and simple to write.

Large Queries in Real-Time for Enterprise

Speaker: Matt Stump, Senior Backend Engineer at KISSMetrics
The ability to manipulate and query very large datasets in realtime is a pressing need for most large data enterprises. Recently, we've seen an explosion of tools such as Impala or Druid, but all of these tools suffer from single points of failure or can't deliver the sub 1 second query times necessary for realtime results. Together we'll explore how to break down these seemingly intractable problems. We'll learn how to build horizontally scalable query engines with Cassandra, capable of sub-second query times across multi-billion row datasets.

Large Scale Data Ingestion, Processing and Analysis: Then, Now & Future

Speakers: Ameet Chaubal, Technologist and Fausto Inestroza, Architect at Accenture
The presentation aims to highlight the challenges posed by large scale and near real-time data processing problems. In past, such problems were solved using conventional technologies, primarily a database and JMS queue. However these solutions had their limits and presented serious problems in terms of scale and redundancy. The new breed of products - a la Cassandra & Kafka, being innately distributed in their design, aim to tackle such challenges in a very elegant manner. The presentation will showcase some of the use cases of this genre from the industry and describe the solutions which have been increasing in their sophistication.

Lightning Talks

John Wrobel, Director at SanDisk
Scaling Cassandra on SSDs

Yuki Morishita, Apache Cassandra Committer & Software Engineer at DataStax
How to Contribute to Cassandra

Nate McCall, Development Lead at Apigee
Adding Your Own Thrift Method in 5 Minutes

Yue Cathy Chang, Sr. Director of Business Development at Impetus
Impetus: Proven Practices in Leveraging Big Data's Competitive Advantage

Eyal Reuveni, Software Engineer at Eventbrite
Cassandra at Eventbrite

Joey Jablonski, Director of Product Management at Dell
Redefining Security for Big Data

Brian Hawkins, Senior Software Engineer at Proofpoint
KairosDB: Bob's Story

Joaquin Casares, Software Engineer at DataStax
Introduction to DataStax Enterprise

C. Scott Andreas, Engineer at Boundary

Praveen Kumar, Engineering Manager at Equinix
Equinix: Big Data Platform

Jeremy Hanna, Senior Support Engineer at DataStax
Troubleshooting Cassandra

Lock it Up: Securing Sensitive Data

Speaker: Sam Heywood, Sr. Director of Products at Gazzang
As adoption of NoSQL solutions like Apache Cassandra grows, so too does the likelihood that organizations will use it to capture and analyze sensitive data. Enterprises that don't take every precaution to protect this data leave themselves exposed to risk of a data breach, and depending on the regulatory nature of the data, fines for noncompliance. This session will discuss how transparent data encryption and advanced key management protect data at-rest and in-flight, so regardless of where the data resides — either on premises or in the cloud -- it remains garbled and unreadable to all people, processes and applications that don't require immediate access. The session will also cover DevOps automation tools that ensure rapid distributed deployment of big data security across thousands of nodes.

Cassandra at eBay Scale

Speakers: Feng Qu, Principal DBA and Anurag Jambhekar, Senior Manager of Database Infrastructure
We have seen rapid adoption of C* at eBay in past two years. We have made tremendous efforts to integrate C* into existing database platforms, including Oracle, MySQL, Postgres, MongoDB, XMP etc.. We also scale C* to meet business requirement and encountered technical challenges you only see at eBay scale, 100TB data on hundreds of nodes. We will share our experience of deployment automation, managing, monitoring, reporting for both Apache Cassandra and DataStax enterprise.

Netflix Open Source Tools and Benchmarks for Cassandra

Speaker: Adrian Cockcroft, Cloud Architect at Netflix
Netflix has updated and added new tools and benchmarks for Cassandra in the last year. In this talk we will cover the latest additions and recipes for the Astyanax Java client, updates to Priam to support Cassandra 1.2 Vnodes, plus newly released and upcoming tools that are all part of the NetflixOSS platform. Following on from the Cassandra on SSD on AWS benchmark that was run live during the 2012 Summit, we've been benchmarking a large write intensive multi-region cluster to see how far we can push it. Cassandra is the data storage and global replication foundation for the Cloud Native architecture that runs Netflix streaming for 36 Million users. Netflix is also offering a Cloud Prize for open source contributions to NetflixOSS, and there are ten categories including Best Datastore Integration and Best Contribution to Performance Improvements, with $10K cash and $5K of AWS credits for each winner. We'd like to pay you to use our free software!

No, Really. These Companies Transformed Their Businesses Using Cassandra

During this panel discussion, IT executives from Constant Contact, Ooyala and HealthCare Anytime will discuss how they are transforming their businesses using the power of big data. The panelists will discuss how the increased expectations of customers and the evolving role of IT caused them to each lead initiatives designed to change their business, the technical challenges they faced and how DataStax Enterprise helped them achieve their goals. Lara Shackelford, vice president of marketing at DataStax will lead the discussion.

No Whistling Required: Cabs, Cassandra, and Hailo

Speaker: Dave Gardner, Senior Engineer at Hailo
Hailo has leveraged Cassandra to build one of the most successful startups in European history. This presentations looks at how Hailo grew from a simple MySQL-backed infrastructure to a resilient Cassandra-backed system running in three data centres globally. Topics covered include: the process of migration, experience running multi-DC on AWS, common data modeling patterns and security implications for achieving PCI compliance.

Optimizing the Public Cloud for Cost and Scalability with Cassandra - The MetricsHub Story

Speaker: Charles Lamanna, MetricsHub Founder & Developer Lead and Ricardo Villalobos, Senior Cloud Architect at Microsoft
MetricsHub is a monitoring and scalability service for public clouds, allowing companies to continuously gather data from their systems and auto-scale their deployments to optimize service costs. Taking advantage of Cassandra rapid ingestion rates, reliable replication model, and easiness of deployment, Metrics Hub can handle billions of datapoints per day. During this session, you will learn about the architecture supporting this service, which combines the power of the PaaS + IaaS on the Windows Azure platform.

The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop

Speakers: Derek Bromenshenkel and Jeff Smoley, Infrastructure Architects at NativeX
NativeX (formerly W3i) recently transitioned a large portion of their backend infrastructure from Microsoft SQL Server to Apache Cassandra. Today, its Cassandra cluster backs its mobile advertising network supporting over 10 million daily active users that produce over 10,000 transactions per second with an average database request latency of under 2 milliseconds. Come hear our story about how we were successful at getting our .NET web apps to reliably connect to Cassandra. Come learn about FluentCassandra, Snowflake, Hector, and IKVM. It's a story of struggle and perseverance, where everyone lives happily ever after.

Practice Makes Perfect: Extreme Cassandra Optimization

Speaker: Albert P Tobey, Tech Lead, Compute and Data Services at Ooyala
Ooyala has been using Apache Cassandra since version 0.4. Our data ingest volume has exploded since 0.4 and Cassandra has scaled along with us. Al will cover many topics from an operational perspective on how to manage, tune, and scale Cassandra in a production environment.

Processing an Avalanche of Medical Records

Speaker: Terrell Deppe, CTO at HealthCare Anytime
HealthCare Anytime provides Web-based portal solutions that assist healthcare organizations in achieving meaningful use, optimized operations, and increased patient and staff satisfaction. During this speaking session, HealthCare Anytime CTO Terrell Deppe will discuss the challenges his company faced when processing an "avalanche" of patient records and how he utilized DataStax's Cassandra-based big data platform to improve their product's performance while reducing costs.

Real-Time Big Data with Storm, Cassandra, and In-Memory Computing

Speakers: DeWayne Filppi, Technical Account Manager at GigaSpaces
This session will describe how to resolve the processing limitations by placing the streaming and data store interfaces in-memory as well, through an in-memory computing platform, and also how to resolve the complexity challenge by implementing a DevOps approach that abstracts all the underlying infrastructure and provides single-click management of all the application tiers and services, on any environment (private/public cloud, bare metal…). And the best news is that all this optimization can be implemented seamlessly, with no code change to your apps.

Real-time Analytics using Cassandra, Spark and Shark

Speaker: Evan Chan, Ooyala
This session covers our experience with using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.We will start by surveying the current Cassandra analytics landscape, including Hadoop and HIVE, and touch on the use of custom input formats to extract data from Cassandra. We will then dive into Spark and Shark, two memory-based cluster computing frameworks, and how they enable often dramatic improvements in query speed and productivity, over the standard solutions today.

(Re)-Building the Social Grid for Global Telcos @ 1/10th the Market Cost

Speaker: Darshan Rawal, VP of Engineering at Openwave Messaging
Darshan Rawal is leading the development of hybrid cloud based messaging products for global Tier 1 Telcos. Darshan has been working in Silicon valley since 2000, building nimble, cost effective products/services, handling millions of users and billions of transactions per day. Previous to Openwave Messaging, Darshan held engineering positions @ SS8 networks, Yahoo, DE Shaw, yp.com and has a M.S in Software Engineering from Carnegie Mellon University.

Remember Me! Session Clustering with Cassandra

Speaker: Les Hazlewood, Co-Founder & CTO of Stormpath and Apache Shiro PMC Chair
In this session Les Hazlewood, the Apache Shiro PMC Chair, will cover Shiro's enterprise session management capabilities, how it can be used across any application (not just web or JEE applications) and how to use Cassandra as Shiro's session store, enabling a distributed session cluster supporting hundreds of thousands or even millions of concurrent sessions. As a working example, Les will show how to set up a session cluster in under 10 minutes using Cassandra. If you need to scale user session load, you won't want to miss this!

Searching for a Needle in a Big Data Haystack

Speaker: Jason Rutherglen, Senior Big Data Engineer at DataStax
The presentation demonstrates how Solr may be used to create real-time analytics applications. In addition, Datastax Enterprise 3.0 will be showcased, which offers Solr version 4.0 with a number of improvements over the previous DSE release. A realtime financial application will run for the audience, and then a detailed look at how the application was built. An overview of Datastax Enterprise Solr features will be given, and how the many enhancements in DSE make it unique in the marketplace.

Splunk + Cassandra = New value to business

Speaker: Eddie Satterly, Chief Big Data Evangelist at Splunk
The session will demonstrate Splunk integration with Cassandra today and discuss more concepts for an integrations to come in the future.

The State of CQL

Speaker: Sylvain Lebresne, Apache Cassandra Committer and Engineer at DataStax
Abstract Since its inception, the Cassandra Query Language (CQL) has grown and matured, resulting in the 3rd version of the language (CQL3) being finalized in Cassandra 1.2. Compared to the legacy Thrift API, CQL3 aims at providing an API that is higher level and more user friendly but still fully assumes the distributed nature of Cassandra and it's storage engine. This presentation will present CQL3, describing the reasoning and goals behind the language as well as the language itself. CQL's relationship with Thrift will be touched on, along with the CQL binary protocol that has been introduced in Cassandra 1.2. This presentation will wrap up by discussing the future of CQL.

Stepping Through the Lifecycle of a Service Offering with Cassandra

Speaker: Chris "Mac" McEniry and Igor von Nyssen, Systems Architect at Sony Network Entertainment
It's a fine line to walk for incorporating new technologies in an organization with 15+ years of legacy software. In this presentation, we'll look at the lifecycle and adoption of Cassandra from a skunkworks project to a full fledged service in a legacy organization.

Stop Crippling Your Business: Fundamental Considerations Everyone Needs to Know

Speakers: Billy Bosworth, DataStax; Vincent Dell’Anno, Accenture; John Whittaker, Dell
All too often, the wrong standardization and/or corporate bureaucracy can absolutely undermine the success of a new project. In this panel discussion we will explore the necessary requirements and flexibility needed to efficiently handle all aspects of your big data infrastructure: from hardware, to software, to implementation strategy. Mistakes in any one of these categories can fundamentally limit your ability to execute, leading to tremendous cost and missed opportunity.

Suicide Risk Prediction Using Social Media and Cassandra

Speaker: Ken Krugler, Big Data Consulting at Scale Unlimited
In this presentation, Ken will describe a portion of an early-phase project that uses social media data (tweets, Facebook posts, etc.) from service personnel to predict suicide rates. There's a lot of motivation to provide better data for military psychologies, since more military wind up taking their own lives than are killed in the line of duty. By analyzing social media data that is voluntarily provided by personnel, plus a predictive analytics system, we can provide assessments that help mental health workers focus their time and energy on the most at-risk individuals. This project uses Cassandra as the scalable storage system for this social media data, which is then analyzed in a distributed environment using Hadoop. The project also uses the Solr search support from DataStax Enterprise to provide ways for users to dig into the underlying data, which is critical when understanding the assigned risk levels.

Taking Bytes from Cassandra Clients

Speaker: Lewis John McGibbney, Post Doctoral at Stanford University
Since early 2012 Gora has been proudly participating as an honorary Incubator post-grad within the ASF. This presentation provides Renato and Lewis' perspective on a phenomenon they refer to as the "big datastore client wars", which is a real life challenge they've discovered whilst attempting to integrate several big data backends (Accumulo, Cassandra, HBase, MySQL, HSQLDB, Amazon's DynamoDB, MongoDB) under one common persistence layer and, in the process, obtain optimal results over Gora operations. They emphasize their approach to addressing this problem by discussing a pluggable Cassandra client infrastructure (Hector-client, Datastax java driver, intravert-ug, etc) adapted specifically for the gora-cassandra module.

Taking Risks Without Risking Your Career

Speaker: Christos Kalantzis, Netflix and Billy Bosworth, DataStax
Let's face it, these key technology decisions are about way more than just technology. The stakes are high, and your career is on the line. Play it safe, and you could become irrelevant. But taking risks is risky business. These two choices are always in tension, and nobody understands this balance better than Netflix, a company with a history of bold, calculated risk taking.

The World's Next Top Data Model

Speaker: Patrick McFadin, Principal Solutions Architect at DataStax
You know you need Cassandra for it's uptime and scaling, but what about that data model? Let's bridge that gap and get you building your game changing app. We'll break down topics like storing objects and indexing for fast retrieval. You will see by understanding a few things about Cassandra internals, you can put your data model in the spotlight. The goal of this talk is to get you comfortable working with data in Cassandra throughout the application lifecycle. What are you waiting for? The cameras are waiting!

Time for a New Relationship - Intuit's Journey from RDBMS to Cassandra

Speaker: Mohit Anchlia, Architect at Intuit
This session talks about Intuit’s journey of our Consumer Financial Platform that is built to scale to petabytes of data. The original system used a major RDBMS and from there, we redesigned to use the distributed nature of Cassandra. This talk will go through our transition including the data model used for the final product. As with any large system transition, many hard lessons are learned and we will discuss those and share our experiences.

Time is Money

Speakers: Jake Luciani and Carl Yeksigian, Quantitative Strategists at BlueMountain Capital Management
This session will focus on our approach to building a scalable TimeSeries database for financial data using Cassandra 1.2 and CQL3. We will discuss how we deal with a heavy mix of reads and writes as well as how we monitor and track performance of the system.

Time-Series Metrics with Cassandra

Speaker: Mike Heffner, Engineer & Co-Founder at Librato
Librato's Metrics platform relies on Cassandra as its sole data storage platform for time-series data. This session will discuss how we have scaled from a single six node Cassandra ring two years ago to the multiple storage rings that handle over 150,000 writes/second today. We'll cover the steps we have taken to scale the platform including the evolution of our underlying schema, operational tricks, and client-library improvements. The session will finish with our suggestions on how we believe Cassandra as a project and its community can be improved.

Virtual Nodes: Rethinking Topology in Cassandra

Speaker: Eric Evans, Apache Cassandra Committer and Chief Architect at OpenNMS
A discussion of the recent work to transition Cassandra from its naive 1-partition-per-node distribution, to a proper virtual nodes implementation.

What were they thinking?

Thomas J. Glazier, Senior Big Data Architect at Accenture
Nothing is more frustrating than knowing you may have the right solution to a problem, only to have the rug pulled out from under you and your project gets derailed for unknown reasons or for unclear understanding of your solution. While you may not see your job as being a business champion, the fact is that you can be a powerful force to stop your company from making the wrong choice.

Understanding how others in the organization see Cassandra and similar technologies can help bridge the gap between engineering and management. In this plenary session, Accenture senior big data architect, TJ Glazier, shows you how managers see competing technologies and how you can help them cut through the noise.

When Bad Things Happen to Good Data: A Deep Dive Into How Cassandra Resolves Inconsistent Data

Speaker: Jason Brown, Senior Software Engineer at Netflix and Apache Cassandra Committer
This talk focuses Cassandra's anti-entrpoy mechanisms. Jason will discuss the details of read repair, hinted handoff, node repair, and more as they aide in reolving data that has become inconsistent across nodes. In addition, he'll provide insight into how those techniques are used to ensure data consistency at Netflix.

Speakers

Gurvinder Singh Ahluwalia, CTO for Cloud Computing, West US Region at IBM

Session: Are You Flexible Enough? Thoughts on the Future of Big Data and the Cloud
Gurvinder Singh Ahluwalia is the CTO for Cloud Computing across LoB’s at IBM covering the West US market. In this role, he serves strategy and architecture leadership in the field to customers, represents 60-70% of the cloud portfolio at IBM, and creates heterogeneous, open source, cloud based solutions and business models for clients. During Distributed Computing era, Guri immersed in design-deploy-run of the second largest routed network in the enterprise world at Motorola. During the Internet/Mobile era, Guri led various multi-party web commerce integrations, charted seminal RFID/M2M strategy, and led as CTO a decision analytics software startup. He compelled a switch in platform direction, landing Embedded Linux on a wireless handheld for the first time by a US OEM at TI, dev to market soft launch in 18 months. During current Cloud Computing era, Guri stewards customers, IBM, and industry through a structural shift to cloud platforms, cloud integration, and cloud formations. He is engaged in open source and standards related communities including OpenStack and CSCC. Guri is a frequent speaker at events including Interop, ARM TechCon, SIIA/AATC, Design West, and The Open Group. Gurvinder Ahluwalia has a MBA from University of Texas Austin, MS Computer Science from Jackson State, and BSc from St. Stephen’s College Delhi.

Billy Bosworth, CEO at DataStax

Billy is responsible for the strategy, explosive growth, and day-to-day operations of DataStax. He has 20 years of experience in the database industry in roles ranging from DBA to senior executive. Prior to DataStax, Billy spent 6 years at Quest Software, a provider of systems management software, where his most recent role was VP and GM of the database business unit. Under his leadership, the industry-leading Quest database business grew from supporting traditional relational databases to a portfolio that now includes tools for cloud, NoSQL, columnar, and Hadoop databases, as well as business intelligence offerings. Prior to Quest, Billy led product teams for Embarcadero Technologies' database productivity solutions. Billy holds a bachelor of science in computer science from the University of Louisville.

Rick Branson, Infrastructure Engineer at Instagram

Session: Cassandra at Instagram
Rick is an Infrastructure Engineer at Instagram where he spends his days building scalable systems to deal with their meteoric growth rates. Rick is a totally impatient dude, so he’s spent the last 10 years obsessed with making complex production systems faster and more reliable. Previously he was a Solutions Architect at DataStax, and has held various engineering roles at Coroutine, Mosaik Solutions, and FedEx.

Matthias Broecheler, CTO at Aurelius

Session: Distributed Graph Computing with Titan and Faunus
Matthias Broecheler is the lead developer of the distributed graph database Titan and has researched large scale graph database systems for more than 5 years. His award-winning research includes high performance index structures and query answering algorithms for graph structured data. In addition, he developed the Probabilistic Similarity Logic (PSL) machine learning framework to analyze and reason about multi-relational data. Matthias holds a Ph.D. in Computer Science from the University of Maryland.

Derek Bromenshenkel, Infrastructure Architect at NativeX

Session: The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
Derek Bromenshenkel is an Infrastructure Architect at NativeX with over 6 years of software development experience. Recently he's helped NativeX's Mobile Business become more scalable and resilient by utilizing Cassandra for high volume data persistence.

Jason Brown, Senior Software Engineer at Netflix and Apache Cassandra Committer

Session: When Bad Things Happen to Good Data: A Deep Dive Into How Cassandra Resolves Inconsistent Data
Jason Brown is a Senior Software Engineer at Netflix, as well as being an Apache Cassandra Committer. He holds a Master’s degree in Music Composition is searching for time to write a second string quartet.

Evan Chan, Ooyala

Session: Real-time Analytics using Cassandra, Spark and Shark
This session covers our experience with using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.We will start by surveying the current Cassandra analytics landscape, including Hadoop and HIVE, and touch on the use of custom input formats to extract data from Cassandra. We will then dive into Spark and Shark, two memory-based cluster computing frameworks, and how they enable often dramatic improvements in query speed and productivity, over the standard solutions today.

Colin Charles, Chief Evangelist at Monty Program Ab

Session: Can't we all just get along? MariaDB and Cassandra
Colin Charles works at Monty Program Ab, on MariaDB. He lives in Kuala Lumpur, Malaysia and had worked at MySQL since 2005. Before joining MySQL, he worked actively on the Fedora and OpenOffice.org projects. He's spoken at many conferences - linux.conf.au, The MySQL Conference & Expo, foss.in, to name a few.

Ameet Chaubal, Technologist at Accenture

Session: Large Scale Data Ingestion, Processing and Analysis: Then, Now & Future
Ameet Chaubal is a technologist within Accenture's Emerging Technology practice. He has over 15 years of experience in all aspects of computing and lately has been focusing on solving big data storage and computation problems. He is interested in identifying the limits of conventional technology and adapting the bleeding edge solutions to fill the gaps.

Andy Cobley, Lecturer at University of Dundee

Session: Hardware Agnostic: Cassandra on Raspberry Pi
Andy Cobley is a lecturer at the University of Dundee in Big Data, Business Intelligence and Internet programming.

Adrian Cockcroft, Cloud Architect at Netflix

Session: Netflix Open Source Tools and Benchmarks for Cassandra
Adrian Cockcroft is the director of architecture for the Cloud Systems team at Netflix. He is focused on availability, resilience, performance, and measurement of the Netflix cloud platform, and has presented at many conferences, including the Cassandra Summit, QCon/GOTO, Cloud Connect, Velocity, Gluecon and Structure. Adrian is also well known as the author of several books while a Distinguished Engineer at Sun Microsystems: Sun Performance and Tuning; Resource Management; and Capacity Planning for Web Services. From 2004-2007 he was a founding member of eBay Research Labs. He graduated with a BSc in Applied Physics from The City University, London.

Cornelia Davis, Senior Technologist at Pivotal

Session: Cassandra on Cloud Foundry
As a technologist within the Solutions R&D Group of Pivotal, Cornelia Davis is responsible for the architecture, design and construction of solutions that address customer needs through a combination of Pivotal products. A self-proclaimed propeller head, her areas of expertise include XML, Atom, mashups, and programming models and languages, and she is a frequent speaker and writer on RESTful SOA. Her main focus is on web-based, distributed, scale-out software systems with her most recent work centering on Platform as a Service (PaaS) and next generation NoSQL databases. Cornelia studied at California State University, Northridge and Indiana University and holds a B.S. and M.S. in Computer Science.

Mark Davis, Principal Engineer at Dell

Session: Big Data Analytics – Realize the Investment from Your Big Data Clusters
Mark Davis is founder of Kitenga Inc, acquired by Dell Software in 2012, and serves as its Chief Technology Officer. Mr. Davis has 19 years experience in computational linguistics, information retrieval, business development in text analytics, information access and related areas. He founded Inventix, was also a DoD/Intelligence Community researcher at Computing Research Lab in New Mexico for 7 years and was author or co-author on over 20 papers, book chapters and journal articles in cross-language text retrieval, evolutionary optimization, machine learning, user interfaces for information access, and cognitive science. He spent time at Santa Fe Institute studying complex systems theory. He holds a BSEE and MSEE from New Mexico State University, with specializations in information theory and pattern recognition. Mr. Davis holds BSEE and MSEE in information theory from New Mexico State University.

Terrell Deppe, CTO of HealthCare Anytime

Session: Processing an Avalanche of Medical Records
Terrell is the Chief Technology Officer of HealthCare Anytime. Terrell has nearly 30 years of professional software development experience. Nearly half of his career has been spent in healthcare where he has addressed the needs of patient, payers and providers.

Jonathan Ellis (@spyced), DataStax CTO and Project Chair of Cassandra

Session: The State of Cassandra
Jonathan is CTO and co-founder at DataStax. Prior to DataStax, Jonathan worked extensively with Apache Cassandra while employed at Racksace. Prior to Rackspace, Jonathan built a multi-petabyte, scalable storage system based on Reed-Solomon encoding for backup provider Mozy.

Eric Evans, Apache Cassandra Committer and Chief Architect at OpenNMS

Session: A New Order: Topology in Cassandra
Eric has more than a decade of experience in large-scale distributed systems, having held roles in both operations and engineering. An early employee of Rackspace, he implemented a global DNS infrastructure utilizing IP anycast (possibly the first), and a novel data-center-wide IDS for which a patent was awarded. An avid open source hacker, Eric is a developer with the Debian Project and a member of the Apache Cassandra PMC. He resides in Texas where he works on distributed network management for The OpenNMS Group.

Sameer Farooqui, Freelance Big Data Consultant and Trainer

Session: Comparing Architectures: Cassandra vs the Field
Sameer is Marakana's Big Data specialist, with deep industry expertise in the Hadoop domain. For the past five years, he has deployed various clustering software packages internationally to clients including fortune 500, governments, hospitals and banks.

Michael Figuiere, Developer Advocate at DataStax

Session: Java and .NET Client Drivers - Cassandra development on fire
Michael is an engineer and a developer advocate at DataStax where he actively works to improve Cassandra. At ease with both Enterprise Java and lower level technologies, he specializes in distributed architectures and topics such as NoSQL, search engines, and data processing. He often speaks about NoSQL in conferences and User Groups and loves to write about his favorite topics.

DeWayne Filppi, Technical Account Manager at GigaSpaces

Session: Real-Time Big Data with Storm, Cassandra, and In-Memory Computing

Dave Gardner, Senior Engineer at Hailo

Session: No Whistling Required: Cabs, Cassandra, and Hailo
Dave Gardner is part of the services team at Hailo - the taxi app company. He has helped build the Hailo platform that brings together taxi drivers and passengers in 6 cities across the globe and counting, focussing on availability and resilience. Dave has worked with Apache Cassandra since 2010 and is the founder of the Cassandra London user group.

Brian Gentile, Chairman and CEO at Jaspersoft

Session: Analytic Fabric, The Future of Analytic Computing
Brian Gentile brings a successful, 27-year technology track record to Jaspersoft, helping it to become the open source business intelligence market leader, measured by commercial size and growth, production deployments of its software, the size and vibrancy of its community, and product downloads. Brian joined Jaspersoft as its first independent Board member in 2005 and then as CEO in 2007.

Peter Goldmacher, Managing Director and Senior Research Analyst at Cowen and Company

Session: The Darwinism of Data Management Systems: the evolution required to support evolving data types
Peter Goldmacher is a Managing Director and Senior Research Analyst. He joined Cowen in 2004 and has experience working in the computer industry as well as on Wall Street. He started his career at Oracle, working for six years in variety of departments including sales ops, consulting, marketing, and finance, and he has also worked at BMC Software as Director, Corporate Planning and Strategy. Mr. Goldmacher had a brief stint at a start-up company called Callidus Software before joining Merrill Lynch as a research analyst in 1999, where he covered core technology as well as backed up the lead analyst on the large cap, infrastructure and apps software names. Just prior to joining Cowen, Mr. Goldmacher was senior software analyst at IRG Research, a subsidiary of TheStreet.com. He holds a B.A. in Political Science from the University of Colorado.

Rich Hammel, Director of Advanced Manufacturing at Brocade Communications

Session: Data as Competitive Advantage in Manufacturing
Rich has spent the last 18 years leading engineering teams in development of spacecraft, elevators, microfluidics, biotechnology and networking gear. Rich enjoys spending time teaching, coaching, rough housing and learning from his two young sons.

Brian Hawkins, Senior Software Engineer at Proofpoint

Brian Hawkins is a senior software engineer at Proofpoint. Brian has been developing software for over 13 years, has a Masters Degree in Computer Science and has worked with big data projects for the past 5 years. Brian is an avid fan of open source and has created several open source projects, most notably the tablesaw build tool.

Les Hazlewood, Co-Founder & CTO of Stormpath and Apache Shiro PMC Chair

Session: Remember Me! Session Clustering with Cassandra
Prior to forming Stormpath, Les held senior architectural positions at Bloomberg and Delta Airlines and he was former CTO of a software engineering firm supporting educational and government agencies. Les has been actively involved in Open Source for more than 10 years, committing or contributing to projects like the Spring Framework, JBoss, and Apache Shiro. Les has a BS in Computer Science from Georgia Tech, and practices Kendo and studies Japanese when he's not coding.

Mike Heffner, Engineer & Co-Founder at Librato

Session: Time-Series Metrics with Cassandra
Mike Heffner works at Librato ensuring their metrics platform is scalable and performant. At Librato he spends his time wrangling their Cassandra datastore, building distributed backend systems for measurement processing and assisting on development of various API components. Mike enjoys contributing to open source and has been active in the startup and academic environments for many years.

Sam Heywood, Sr. Director of Products at Gazzang

Session: Lock it Up: Securing Sensitive Data
Sam Heywood is senior director of products at Gazzang, responsible for direction of the company's data security and big data solutions. He is a technology industry veteran and strong proponent of the cloud as a platform for developing and delivering enterprise-business solutions. Prior to joining Gazzang, Sam was responsible for driving product roadmap, planning and delivery processes at UShip. He has also held product and marketing management roles at Convio and Tivoli. Sam earned a Bachelors in Computer Sciences from the University of Texas at Austin and an MBA from the McCombs School of Business, UT Austin.

Fausto Inestroza, Architect at Accenture

Session: Large Scale Data Ingestion, Processing and Analysis: Then, Now & Future
Fausto Inestroza is an architect at the Accenture Technology Labs, the dedicated R&D organization of Accenture. His focus is on data platforms, analytical processes and distributed systems.

Joey Jablonski, Product Management at Kitenga Analytics, Dell

Joey is Director of Product Management at Kitenga Analytics and is currently focused on the architecture and strategy for the deployment of complex analytic technologies including the Kitenga Analytic Suite. Joey leads Dell's strategy and product activities related to advanced analytics for unstructured data. Joey's technical interests include Big Data, high performance computing, low-latency networking technologies and information security. Joey has previously held technical and organizational leadership roles at Data Direct Networks, HP and Sun Microsystems.

Anurag Jambhekar, Senior Manager of Database Infrastructure at eBay

Session: Cassandra at eBay Scale
Anurag Jambhekar is Sr. Manager, Database Infrastructure at ebay where his team is responsible for 1000+ heterogeneous databases supporting the high volume and concurrency of ebay.com. He is obsessive about getting the best performance from application, databases and systems. He likes solving problems around scalability and performance and very excited about the innovation happening in the data and database arena in last decade .Currently, he is leading the evolution of next generation of database technologies and practices at ebay. Prior to ebay, he has held database operational roles in Fry's electronics, Wells Fargo and TCS.

Christos Kalantzis, Engineering Manager of Cloud Persistence Engineering at Netflix

Session: Eventual Consistency != Hopeful Consistency
A humanitarian at heart, I try to focus my efforts in creating technology that enhances our lives. I have built and lead teams at YouSendIt and Netflix to scale out their persistence layers, create a cloud file system and to adopt Apache Cassandra as a scalable and highly available data solution. I've worked as a DB2, SQL Server and MySQL DBA for over 10 years and through, sometimes painful, trial and error I have learned the advantages and limitations of RDBMS and when the modern NoSQL solutions make sense. I believe in knowledge sharing, that is why I am a huge advocate of Open Source software. I share my software experience through blogging, pod-casting and mentoring new start-ups. I sit on the tech advisory board of the OpenFund Project which is an Angel VC for European start-ups.

Matt Kennedy, Big Data Solutions Architect at Fusion-IO

Session: Cassandra on Flash: Performance & Efficiency Lessons Learned
Matt Kennedy is a Solutions Architect at Fusion-IO. He works with customers to apply the performance benefits of low-latency server-side Flash technology to NoSQL databases, Hadoop clusters and other Big Data technologies. Matt has been a Cassandra user and occasional contributor since version 0.7 and is a co-organizer of the Cassandra meetup group in the Washington DC area.

Renat Khasanshyn, Founder and CEO, Altoros

Session: Cassandra on Cloud Foundry
Renat Khasanshyn is CEO of Altoros Systems and Venture Partner at Runa Capital. Employing one of the largest pools of NoSQL/Hadoop/Cloud Foundry engineering expertise on the market, Altoros provides vendor-independent professional services, consulting and integration to service providers, software vendors and enterprises. Mr. Khasanshyn is also founder of Apatar, an open source data integration toolset, founder of Silicon Valley NewSQL User Group and co-founder of the Belarusian Java User Group. He studied Engineering at Belarusian National Technical University.

Michael Kjellman, Software Engineer at Barracuda Networks

Session: Hindsight is 20/20. MySQL to Cassandra
Michael Kjellman is a Software Engineer, from San Francisco, working at Barracuda Networks. Michael works across multiple products, technologies, and languages. He primarily works on Barracuda's spam infrastructure and web filter classification data.

Sean Knapp, Executive Vice President and Chief Product Officer at Ooyala

Session: It's like your parents: Relational and NoSQL can co-exist
Sean Knapp is a co-founder of Ooyala. As Executive Vice President and Chief Product Officer, he oversees all product and engineering and helps to define and execute Ooyala's product vision. In his previous role as Ooyala's CTO, Sean led the development of Ooyala’s award-winning scalable, efficient and powerful video solutions. As CTO, he shaped Ooyala's technical strategy and roadmap and led technical collaborations with partners, earning Ooyala recognition as the leading innovator in the video space and creating new business opportunities for Ooyala around the world.

Ken Krugler, Big Data Consulting at Scale Unlimited

Session: Suicide Prevention Using Social Media and Cassandra
Ken is the President of Scale Unlimited, a consulting and training company for big data processing and web mining problems, using Hadoop, Cascading, and Solr.

Praveen Kumar, Engineering Manager at Equinix

Engineering Manager at Equinix
Praveen developed Equinix's big data strategy and is currently leading architecture and the roll-out of Equinix – Big Data Platform. Praveen has 14 years of experience in building high quality distributed software systems requiring high performance, scalability and short response time.

Charles Lamanna, MetricsHub Founder and Developer Lead at Microsoft

Session: Optimizing the Public Cloud for Cost and Scalability with Cassandra - The MetricsHub Story
Charles currently works on the Windows Azure monitoring team to define the next generation of cloud monitoring and management. Charles was a Responsible for technical and business areas for MetricsHub. He was a member of founding team and developed the company from idea stage, to revenue and then to exit. MetricsHub was acquired by Microsoft on March 4th, 2013. The premium MetricsHub product was offered as a no charge service following the acquisition.

Sylvain Lebresne, Apache Cassandra Committer and Engineer at DataStax

Session: The State of CQL
Sylvain works at DataStax on the Apache Cassandra project for which he is a core contributor, committer and PMC member. He has, amongst other things, played a prominent role in the development of CQL3 and its binary protocol. In a former life, he obtained a Ph.D. in Computer Science from Paris 7's University.

David Leimbrock, CTO at Riptide IO

Session: Data Driven Retail
David is responsible for the technology vision and software engineering at Riptide IO. Dave is passionate about making software that customers love to use. He has spent the past 10 years leading engineering teams that are focused on communicating with and managing intelligent machines. Previously, Dave worked in the Emerging Technology Group at Cisco Systems - developing solutions for their Smart+Connected Buildings Group.

Axel Liljencrantz, Backend Developer at Spotify

Session: How Not to Use Cassandra
Axel Liljencrantz is a backend infrastructure engineer at Spotify, where he works with improving the scalability and stability of the Spotify backend. Axel enjoys holding long-winded monologues on almost any subjects except about himself.

Eric Lubow, CTO & Co-Founder at SimpleReach

Session: Big Architectures for Big Data
Eric Lubow, CTO of SimpleReach, began his career building secure systems at Guardian Digital before joining Conductor.com, where he developed and ran their ad systems. After graduating from Rutgers University with a degree in Information Systems, Eric did multiple tours with the U.S. Army including serving in Iraq as a Lieutenant and Albania as a Captain. He is a board advisor for the IronMatt.org charity for pediatric brain tumors, a mixed martial artist, motorcyclist, and seasoned skydiver.

Jake Luciani, Apache Cassandra Committer and Quantitative Strategist at BlueMountain Capital Management

Session: Time is Money
Jake Luciani is a committer on Apache Cassandra and Apache Thrift. He works at BlueMountain Capital in NYC building a next generation market data database on Cassandra.

Joe Maguire, Founder at Data Quality Strategies, LLC

Session: Data Modelers Still Have Jobs: Adjusting For the NoSQL Environment
Now in his fourth decade in the computer industry, Joe Maguire is an analyst and consultant specializing in data management and requirements analysis. His hard-won perspective is informed by broad experience including twelve years in product development for software vendors (Digital, Lotus, Microsoft, Bachman Information Systems); thirteen years as a consulting modeler, requirements analyst, and information architect for clients (ranging from small startups to Fortune-10 behemoths); and six years as an industry analyst (including stints at Burton Group, Gartner, and O’Kelly Associates), specializing in best practices in data management. He is influential in the Data Quality movement and currently serves as a co-chair of the MIT Chief Data Officer & Information Quality Symposium (July, 2013). He is a much-published author whose books have been praised by a wide range of media outlets including The Mathematica Journal, The Data Access Newsletter, The Boston Sunday Globe, and National Public Radio. He is co-author of Mastering Data Modeling: A User Driven Approach (Addison-Wesley, 2000), and is the invited guest author of a chapter called “What Every CDO Should Know About Big Data” in a book about the Chief Data Officer role, expected soon from MIT Press.

Chris "Mac" McEniry, Systems Architect, Software Architect at Sony Network Entertainment

Session: Stepping Through the Lifecycle of a Service Offering with Cassandra
Mac has been a Systems Architect inside of the operations team at Sony Network Entertainment for the last 3 years. He's spent 13 years building online presences, keeping them running, and helping others get their jobs done. In his spare time, he runs the DefCon network.

Patrick McFadin, Principal Solutions Architect at DataStax

Session: The World's Next Top Data Model
Patrick McFadin is a Principle Solutions Architect at DataStax, the commercial company behind Apache Cassandra. Prior to his work at DataStax he was the Chief Architect at Hobsons, an education services company. He obtained a BS in Computer Engineering from Cal Poly, San Luis Obispo and holds the distinction of being the only recipient of a medal (as anyone can find out) for hacking while serving in the US Navy.

Lewis John McGibbney, Post Doctoral at Stanford University

Session: Taking Bytes from Cassandra Clients
Legislative Informatics PhD. from Glasgow Caledonian University, Glasgow, Scotland. Currently a Post Doctoral Research Scholar within Civil and Environmental Engineering at Stanford University. Member of the Apache Software Foundation, V.P Apache Gora, PMC Apache Nutch, PMC Apache Any23. Previous mentor in Google Summer of Code 2012 program. OASIS Legal Document ML TC member. Lewis currently participates in several open source projects and standards initiatives. Approaching engineering and data problems from a construction view point has provided an interesting journey so far. Lewis has spoken at conferences and workshops on informatics, open/linked/big data and computing in civil engineering.

Renato Javier Marroquín Mogrovejo, Professor at Universidad Catolica San Pablo in Arequipa, Peru

Session: Taking Bytes from Cassandra Clients
Computer Science Master by the Pontifical University of Rio de Janeiro with the thesis titled "Experimental Statistical Analysis of MapReduce Jobs". Computer Science Professor at Universidad Catolica San Pablo in Arequipa, Peru. Google Summer of Code 2012 participant, Apache Gora PMC Memeber and Committer, Open Source and Big Data Enthusiast. Renato has spoken at both open source and academic conferences.

Aaron Morton, Apache Cassandra Committer

Session: In Case of Emergency, Break Glass
Aaron Morton is a Freelance Developer based in New Zealand, and a Committer on the Apache Cassandra project. In 2010 he gave up the RDBMS world for the scale and reliability of Cassandra. He now spends his time advancing the Cassandra project and helping others get the best out of it.

Tim Moreton, CTO at Acunu Ltd

Session: Real World, Real Time Data Modeling
As a co-founder and CTO of Acunu Tim was instrumental in delivering Acunu Analytics, the leading Cassandra-based operational intelligence platform. Tim was previously a senior member of the technical team at Tideway (now BMC), where he led the creation of solutions for managing data centers at Fortune 500 clients. Previously he was CEO of a consultancy delivering data solutions for the aviation sector. He holds a PhD in Computer Science from Cambridge University (UK).

Andrew Noonan, Software Engineer at Gnip

Session: Dude, Where's My Tweet? Taming the Twitter Firehose
Andrew is a software engineer with Gnip in Boulder, Colorado. He has helped the high powered engineering team at Gnip do real-time filtering and provide historical access to the entire twitter corpus for a little over a year now. Andrew was part of the team that first began stress testing cassandra with the large amounts of data that they see and experienced first hand the fact that it is not always rainbows and unicorns when it comes to shinny new technologies like Cassandra.

Jay Patel, Technical Architect at eBay

Session: Buy It Now! Cassandra at eBay
Jay is a hands-on architect within Data Infrastructure & Services at eBay. He architects data storage behind eBay.com, which can scale up to hundred billion database calls every day. He is currently leading the evaluation of NoSQL and adoption of Cassandra for various eBay projects, and is developing NoSQL knowledge bases and best practices. In the not too distant past, he built the first version of eBay’s cloud manager platform. He also developed various social apps as well as SOA and Hadoop prototypes. Before joining eBay, Jay worked on various projects at IBM, and built a Unified Messaging System at a telecom startup. Jay holds an M.S. in Computer Engineering from San Jose State University, and is currently learning cool machine learning techniques at Stanford.

Stefan Piesche, Chief Technology Officer at Constant Contact

Session: Aligning Technology Infrastructure With Horizontal Business Growth
As Chief Technology Officer, Piesche focuses on evaluating new technologies to determine how they can help expand the Constant Contact's products and service architecture to accommodate future growth and increased performance. To do this, he combines technology vision with practical delivery capabilities and engineering best practices. Prior to Constant Contact, Piesche spent eight years at the Cobalt Group, where he was executive director of enterprise architecture and chief architect. While there, he developed and drove corporate technology strategy and provided technical direction, design, and architecture for all product development.

Feng Qu, Principal DBA at eBay

Session: Cassandra at eBay Scale
Started with Oracle 5. Feng has worked with RDBMS in past 20 years at various high tech companies, including DoubleClick, Yahoo, Intuit and eBay. In recent years, he likes NoSQL technology and quickly became a huge Cassandra fan while working at eBay. Now he is responsible for companywide Cassandra projects from design to deployment.

Darshan Rawal, VP of Engineering at Openwave Messaging

Session: (Re)-Building the Social Grid for Global Telcos @ 1/10th the Market Cost with Cassandra
Darshan Rawal is leading the development of hybrid cloud based messaging products for global Tier 1 Telcos. Darshan has been working in Silicon valley since 2000, building nimble, cost effective products/services, handling millions of users and billions of transactions per day. Previous to Openwave Messaging, Darshan held engineering positions @ SS8 networks, Yahoo, DE Shaw, yp.com and has a M.S in Software Engineering from Carnegie Mellon University.

Jason Rutherglen, Senior Big Data Engineer at DataStax

Session: Searching for a Needle in a Big Data Haystack
Jason works at Datastax as a senior Big Data engineer architecting, developing and supporting the Datastax Enterprise product line which includes Solr integrated with Cassandra. His career has involved an array of technologies including search, Hadoop, Hive, mobile phones, cryptography, and natural language processing. Jason has been developing solutions with Lucene and Solr for more than 7 years and is a co-author of 'Programming Hive' and 'Lucene and Solr: The Definitive Guide' from O'Reilly. Jason frequently gives tutorials and speaks at conferences such as Strata, Cassandra Summit, ApacheCon and others.

Eddie Satterly, Chief Big Data Evangelist at Splunk

Session: Splunk + Cassandra = New value to business
Mr. Satterly has served in a variety of roles including developer, engineer, architect and CTO over his 23 year career for Fortune 500 companies. He has been a long time Splunk fan and user. More recently he was able to revolutionize the way Expedia delivers their core web applications that resulted in improved user experience. At Expedia, he created a highly scalable and flexible Splunk environment and as a result was able to retire 35 other systems. With Splunk, he saved $14M in the first 6 months Splunk was installed and avoided many more $MM in outages. He is a recognized expert in the field of big data and has presented at many conference globally on the topic. Eddie has a BS Comp Sci from Indiana University.

Zorawar Biri Singh

Session: Are You Flexible Enough? Thoughts on the Future of Big Data and the Cloud
Zorawar Biri Singh was recently SVP and GM, HP Converged Cloud and HP Cloud Services. In this role, he oversaw HP’s global cloud portfolio & strategy and held overall responsibility for HP public cloud infrastructure, platform services, cloud solutions and cloud ecosystem. Singh has 22 years of global leadership experience as an executive and serial entrepreneur in enterprise IT, online advertising, e-commerce and communications. He has an extensive products and operations background in cloud services & platforms, distributed computing, infrastructure and open source software like OpenStack, data management/analytics, SaaS applications, and data center virtualization/scale-out infrastructure.

Jeff Smoley, Infrastructure Architect at NativeX

Session: The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
Jeff Smoley is an Infrastructure Architect at NativeX with over 13 years of software development experience ranging from VB6 desktop apps to high transaction ASP .Net/WCF web applications. His focus is on building resilient and maintainable applications. Most recently he's helped NativeX's Mobile Business become more scalable and resilient by utilizing Cassandra for high volume data persistence.

Manish Sood, CEO & Founder at Reltio

Session: Adaptive Data Convergence for Life Sciences
Manish is the founder and CEO at Reltio, a software startup that transforms related data into business opportunities by providing applications that are powered by the convergence of data from multiple data sources that include internal applications, 3rd party data feeds and social media. Reltio data convergence applications enable business users to understand markets, drive revenue and reduce risk through actionable, real-time and predictive insights. Manish Sood is responsible for the overall direction and management of Reltio. Prior to founding Reltio, Manish led Product Strategy and Management for the Master Data Management (MDM) product at Informatica and Siperian. Siperian, a company that defined the MDM space and was acquired by Informatica in January of 2010. Under Manish’s product leadership, Siperian (and later Informatica) became the industry-standard MDM platform across verticals including life sciences, financial services, retail, manufacturing, healthcare and government. In the past, Manish has successfully architected and delivered complex data management systems for various fortune 100 clients. Manish is a frequent speaker at data management conferences, webinars and industry events.

Matt Stump, Senior Backend Engineer at KISSMetrics

Session: Large Queries in Real-Time for Enterprise
Senior Backend Engineer for KISSMetrics, DataStax MVP 2012, and author of the forthcoming "Cassandra: A Beginner's Guide" published by Packt.

Godfrey Sullivan, CEO at Splunk

Session: Does Big Data Drive Apps, or Do Apps Drive Big Data?
Godfrey R. Sullivan has served as Splunk's President, Chief Executive Officer and a member of the board of directors since 2008, and as the Chairman since 2011. Prior to joining Splunk, Mr. Sullivan was with Hyperion Solutions Corporation, a performance management software company acquired by Oracle Corporation, from 2001 to 2007, where he served in various executive roles, most recently as President and Chief Executive Officer, and as a member of the board of directors from 2004 until 2007. Mr. Sullivan has served as a member of the board of directors of Citrix Systems, Inc., an enterprise software company, since 2005 and Informatica Corporation, a data integration software provider, since 2008. Mr. Sullivan holds a B.B.A. from Baylor University.

Albert Tobey, Tech Lead, Compute and Data Services at Ooyala

Session: Practice Makes Perfect. Extreme Cassandra Optimization.
Al Tobey is Tech Lead of the Compute and Data services team at Ooyala. His team develops and operates Ooyala's internal big data platform, consisting of Apache Cassandra, Hadoop, and internally developed tools. When not in front of a computer, Al is a father, husband, and trombonist.

Ricardo Villalobos, Senior Cloud Architect at Microsoft

Session: Optimizing the Public Cloud for Cost and Scalability with Cassandra - The MetricsHub Story
Ricardo Villalobos is a seasoned software architect with more than 15 years of experience designing and creating enterprise software applications, particularly those implementing patterns that support widely distributed scenarios. Holding several Microsoft certifications, as well as a master’s degree in business administration, Ricardo has been recently helping companies worldwide in the process of building, migrating and strategizing solutions that run on the Microsoft Windows Azure cloud. You can read his blog at blog.ricardovillalobos.com.

Igor Von Nyssen, Software Architect at Sony Entertainment

Session: Stepping Through the Lifecycle of a Service Offering with Cassandra
Igor is a senior architect at Sony Network Entertainment. He spent most of his career building eCommerce products and systems. In the last two years he has been focusing his projects and teams to deliver services that can support continuous availability. Since he started working with Cassandra his nightmares about system downtime are on the retreat.

Boris Wolf, Lead Engineer CMB Project at the Comcast Silicon Valley Innovation Center

Session: CMB: An Open Message Bus for the Cloud
Boris has more than 10 years of experience working for various technology startups in the Bay Area and is currently the engineering lead for the CMB project at the Comcast Silicon Valley Innovation Center.

John Wrobel, Director at SanDisk Enterprise Division

Session: Scaling Cassandra on SSDs
John Wrobel joined SanDisk when the company acquired Schooner Information Technology. As director for enterprise solutions, he is instrumental in sales operations and market development activities. Prior to joining Schooner, he served as a software sales manager at IBM, where he was responsible for meeting the sales goals of the business through effective planning and budgeting. Wrobel holds a B.S. in Computer Engineering from Lehigh University.

Carl Yeksigian, Quantitative Strategist at BlueMountain Capital Management

Session: Time is Money
Carl graduated from Carnegie Mellon University with a BS in Computer Science. He works as a Quantitative Strategist at BlueMountain Capital in New York City, where he has spent the past year working with Cassandra and comparing other big data solutions to Cassandra.

Jesse Young, VP of Software Development at Zonar Systems

Session: Ground Traffic Control - Logistics with Cassandra
Jesse Young is VP of Software Development at Zonar Systems, a leading provider of fleet telematics, remote engine diagnostics and mobile onboard computing. He has spent his career learning and growing with Zonar, helping scale systems, storage, software and teams of talented engineers.

Sponsors

PLATINUM SPONSOR

Accenture

Accenture is a global management consulting, technology services and outsourcing company, with approximately 261,000 people serving clients in more than 120 countries. Combining unparalleled experience, comprehensive capabilities across all industries and business functions, and extensive research on the world’s most successful companies, Accenture collaborates with clients to help them become high-performance businesses and governments. The company generated net revenues of US$27.9 billion for the fiscal year ended Aug. 31, 2012.

SILVER SPONSORS

FusionIO

Fusion-io delivers the world’s data faster. Our Fusion ioMemory platform accelerates databases, virtualization, cloud computing, big data, and the applications that drive our economy and our daily lives.

Integrating hardware and software to overcome the limitations of legacy architectures and specialized hardware, Fusion ioMemory accelerates businesses from the smallest e-tailers to the world’s largest data centers, social media leaders, and Fortune Global 500 businesses. Our persistent, high capacity ioMemory platform leverages flash memory to significantly increase datacenter efficiency, with enterprise grade performance, reliability, availability, and manageability.

Splunk

Splunk Inc. (NASDAQ: SPLK) provides the engine for machine data™. Splunk® software collects, indexes and harnesses the machine-generated big data coming from the websites, applications, servers, networks and mobile devices that power business. Splunk software enables organizations to monitor, search, analyze, visualize and act on massive streams of real-time and historical machine data. More than 5,200 enterprises, universities, government agencies and service providers in over 90 countries use Splunk Enterprise to gain Operational Intelligence that deepens business and customer understanding, improves service and uptime, reduces cost and mitigates cybersecurity risk. Splunk Storm®, a cloud-based subscription service, is used by organizations developing and running applications in the cloud.

SPONSORS

Acunu

Acunu tackles the complexity and efficiency challenges posed by Big Data applications. Acunu was founded in 2009 and is backed by some of Europe’s top VC funds. Started by a unique mix of researchers and engineers from Cambridge and Oxford Universities, Acunu has offices in London (near Old Street, London’s “Silicon Roundabout”) and California.

Altoros

Altoros Systems is a global software delivery acceleration specialist that provides focused product engineering to technology companies and start-ups. Areas of expertise include NoSQL / distributed computing solutions, Microsoft .NET, Java, Ruby on Rails, Adobe Flex/Flash, HTML5, and mobile technologies (Windows Phone, Apple iOS and Android).

Altoros played a key role in helping its customers create over $500M of enterprise value and raise $120M+ in venture funding from Accel Partners, Andreessen Horowitz, Matrix Partners, North Bridge Venture Partners, Facebook Fund, Khosla Ventures, Greycroft Partners, BV Capital, Split Rock Partners, Sutter Hill Ventures, and First Round Capital.

Boundary

Boundary provides a new kind of application monitoring for new IT architectures: one-second app visualization, cloud-compatible, and only a few minutes from setup to results. Boundary is a privately-held company based in San Francisco, California with venture funding from Lightspeed Venture Partners and Scale Venture Partners. For more information on Boundary visit us on the web at www.boundary.com or on www.twitter.com/boundary.

Compuware

Compuware Corporation, the technology performance company, makes technology make a difference by providing software, experts and best practices to ensure technology works well and delivers value. Compuware solutions make the world's most important technologies perform at their best for leading organizations worldwide, including 46 of the top 50 Fortune 500 companies and 12 of the top 20 most visited U.S. web sites. Learn more at: http://www.compuware.com.

Dell

Dell Inc. (NASDAQ: DELL) listens to customers and delivers innovative technology and services that give them the power to do more. For more information, visit www.dell.com and www.quest.com.

Gazzang

Gazzang provides data security solutions and expertise to help enterprises protect sensitive information and maintain performance in big data and cloud environments. Our technology enables SaaS vendors, health care organizations, financial institutions, public sector agencies and more to meet regulatory compliance initiatives, secure personally identifiable information and prevent unauthorized access to sensitive data and systems. The company is headquartered in Austin, Texas and backed by Austin Ventures and Silver Creek Ventures.

Gigaspaces

GigaSpaces Technologies is the pioneer of a new generation of application virtualization platforms and a leading provider of end-to-end scaling solutions for distributed, mission-critical application environments, and cloud enabling technologies.

Hundreds of organizations worldwide are leveraging GigaSpaces’ technology to enhance IT efficiency and performance, including Global Fortune 500 companies, from top financial service enterprises, e-commerce companies, online gaming providers, and telecom carriers.

Impetus

Impetus provides Big Data thought leadership, consulting and services to the Enterprise market. Our experience extends across the Big Data ecosystem including Hadoop, Cassandra, NoSQL and MPP systems, Predictive Analytics, Machine learning and Visualization.

Jaspersoft

Jaspersoft empowers millions of people every day to make better decisions faster by bringing them timely, actionable data inside their apps and business processes. Its embeddable, cost-effective reporting and analytics platform allows anyone to quickly self serve and get the answers they need and scales architecturally and economically to reach everyone. Thanks to a community that is hundreds-of-thousands strong, Jaspersoft’s commercial open source software has been downloaded millions of times and is used to create the intelligence inside hundreds of thousands of apps and business processes. Jaspersoft is a privately held company with offices around the world.

LucidWorks

LucidWorks transforms the way people access information to enable data-driven decisions. LucidWorks is the only company that delivers enterprise-grade search development platforms built on the power of Apache Lucene/Solr open source search. Employing one quarter of the Core Committers to the Apache Lucene/Solr project, LucidWorks is the largest supporter of open source search in the industry. LucidWorks Search delivers unmatched scalability to billions of documents, with sub-second query and faceting response time. LucidWorks Big Data tightly integrates key Apache projects needed to build and deploy applications requiring access to multi-structured data. Customers include AT&T, ADP, Sears, Ford, Verizon, Cisco, Zappos, Raytheon, The Guardian, The Smithsonian Institution, The Motley Fool, Qualcomm, Taser, eHarmony and many other household names around the world. LucidWorks' investors include Shasta Ventures, Granite Ventures, Walden International and In-Q-Tel. Learn more about the company at http://www.lucidworks.com/.

Microsoft

Founded in 1975, Microsoft (Nasdaq “MSFT”) is the worldwide leader in software, services and solutions that help people and businesses realize their full potential.

SanDisk

SanDisk Corporation (NASDAQ: SNDK) is a global leader in flash memory storage solutions, from research and development, product design and manufacturing to branding and distribution for commercial and retail channels. Since 1988, SanDisk's innovations in flash memory and storage system technologies have provided customers with new and transformational digital experiences. SanDisk's diverse product portfolio includes flash memory cards and embedded solutions used in smart phones, tablets, digital cameras, camcorders, digital media players and other consumer electronic devices, as well as USB flash drives and solid-state drives (SSD) for the computing market. SanDisk's products are used by consumers and enterprise customers around the world.

MEDIA SPONSORS

O'Reilly

O'Reilly Media spreads the knowledge of innovators through its books, online services, magazines, and conferences. Since 1978, O'Reilly Media has been a chronicler and catalyst of cutting-edge development, homing in on the technology trends that really matter and spurring their adoption by amplifying "faint signals" from the alpha geeks who are creating the future. An active participant in the technology community, the company has a long history of advocacy, meme-making, and evangelism.

Planet Cassandra

Certification

Big Data Certification

During the Summit DataStax Ac*ademy certification will be available to all attendees for free. Become a DataStax Certified Cassandra Developer and differentiate your career.

DataStax Ac*ademy certification opens doors for you in the big data world and marks you as a valuable expert to anyone looking for an Apache Cassandra specialist.

Whether you are sharpening your professional big data skills or contributing as the most proficient Cassandra engineer on your team, DataStax Ac*ademy certification provides the credentials you need when implementing mission critical enterprise software utilizing Apache Cassandra.

As the company behind Cassandra, DataStax is the only organization capable of properly administering this certification. Based on open source Cassandra, we will validate you as a trusted trailblazer in this leading technology.

DataStax certifications contain 60 questions timed to 90 minutes maximum, delivered on-line by Knowledge Advisors.

Introducing our first DataStax Ac*ademy Certification:
DataStax Certified Cassandra Developer (DSCCD)

Our initial certification test is for Cassandra 1.1 only. Shortly after we will release the DataStax Certified Cassandra 1.2 Developer test, followed by the DataStax Certified Cassandra 1.2 Administrator test. During the C* Summit and for 24 hours after this, our Cassandra 1.1 exam is free of charge, after which all tests are included in the cost of training.

Cassandra Summit 2013 – Inaugural Executive Track

From Billy Bosworth, Chief Executive Officer, DataStax

Executives have repeatedly told me that they need more business-level information on how Cassandra, the leading enterprise NoSQL database [1] [2], is changing the market. I listened, and in response, we have created the first-ever Executive Track at our annual Cassandra Summit. While our summits have always been incredible learning venues, they were focused exclusively on a technical audience. But with Cassandra assuming a more prominent role in the enterprise, it’s time to take an in-depth look at both the business opportunities, and the personal career impact, that such a change can bring.

You will hear from an impressive roster of speakers who bring real-world experience to the discussions, such as: Godfrey Sullivan, CEO of Splunk; Vincent Dell’Anno, big data lead at Accenture; and Christos Kalantzis of Netflix.

We will cover today’s hottest topics ranging from Cassandra's co-existence with relational databases like Oracle and MySQL, to taking a look at the personal opportunities and risks associated with betting your business on a disruptive technology. You can see the full session lineup below.

In addition to a great schedule, Accenture, as the Platinum Sponsor of the Summit, will be hosting a VIP luncheon that will provide an excellent time for free-flowing conversations. You will also be invited to attend the keynote and Accenture plenary session as well as topic-centric discussions.

So this year, I hope you choose to join your technical team at the Summit and be my guest for these great discussions. Feel free to email me with any questions or topics that you would like to make sure we cover.

I look forward to seeing you at the summit!

Billy Bosworth
Chief Executive Officer, DataStax
bbosworth@datastax.com
@billy_bosworth

Summit Executive Track Overview

  • Keynote Address
    Jonathan Ellis, Apache Cassandra Chairman, and DataStax Co-Founder
  • Getting to The Right Overall Data Architecture
    Accenture
  • Does Big Data Drive Applications, or Do Applications Drive Big Data?
    Godfrey Sullivan, CEO of Splunk
  • Taking Risks Without Risking Your Career
    Christos Kalantzis, Netflix
  • It's No Longer an "Either/Or" World: Relational & NoSQL Databases Will Co-Exist
    Sean Knapp, CTO and Co-Founder of Ooyala
  • Stop Crippling Your Business: Fundamental Considerations Everyone Needs to Know
    Vincent Dell’Anno, Accenture; John Whittaker, Dell
  • The Darwinism of Data Management Systems: The Evolution of Data
    Peter Goldmacher, Managing Director at Cowen and Co.
  • Analytic Fabric, The Future of Analytic Computing
    Brian Gentile, CEO of Jaspersoft
  • Are You Flexible Enough? Thoughts on the Future of Big Data and the Cloud
    Zorawar Biri Singh; Gurvinder Singh Ahluwalia, CTO for Cloud Computing, West US Region at IBM
  • VIP Lunch
    Sponsored by Accenture

Accommodation

Nearby hotels:

Sheraton Fisherman's Wharf - $199

2500 Mason Street,
San Francisco, CA 94133
(415) 362-5500
Room Reservations
Google Map

Hyatt Fisherman's Wharf - $219

555 North Point Street
San Francisco, CA 94133
(415) 486-4412
Room Reservations
Google Map

Hilton SF Union Square - $289

333 O'Farrell Street
San Francisco, CA 94102
(415) 771-1400
Room Reservations
Google Map

Quick Information

View the map of Fort Mason

View the mobile version of the schedule here.

Lost? Have questions? Visit the onsite Hospitality Desk located in Festival pavilion next to registration.

WHEN
Tuesday & Wednesday
June 11-12, 2013

WHERE
SAN FRANCISCO, CA

Ft. Mason Festival Pavilion
San Francisco, California

Maps:
Google Maps   Yahoo Maps   Map Quest   Bing
SPONSORS

PLATINUM SPONSOR

Accenture

SILVER SPONSORS

FusionIO Splunk

SPONSORS

Acunu Altoros Boundary Compuware Dell Gazzang Gigaspaces Impetus Jaspersoft LucidWorks Microsoft SanDisk

MEDIA SPONSORS

O'Reilly Planet Cassandra

Apache Cassandra, Cassandra, Apache Hadoop, Hadoop, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission as of 2011. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by DataStax..