Schedule and Sessions

DataStax Accelerate 2020 Agenda

Filter Sessions by Day:

Filter Sessions by Role:

Filter Sessions by Tags:

No Result Found

Tuesday, May 12, 2020

DAY 1: Opening Keynote

Tuesday, May 12, 2020 | | 9:00 AM - 10:45 AM

ArchitectDeveloperOps and DevOps
Keynote Cloud

Chet Kapoor | CEO, DataStax

Catch the NoSQL and Cassandra Wave

The most successful enterprises are using NoSQL and Cassandra to create transformational, real-world outcomes that are unconstrained by scale. Learn how enterprises like FedEx are winning in today’s fiercely competitive markets and see why now is the time for the technology that is powering both the best of the internet and the world’s most and critical modern applications.

Chet Kapoor

Chet Kapoor | CEO, DataStax

Chet Kapoor is Chairman and CEO of DataStax. He is a proven leader and innovator in the tech industry with  more than 20 years in leadership at innovative software and cloud companies, including Google, IBM, BEA Systems, WebMethods, and NeXT. As Chairman and CEO of Apigee, he led company-wide initiatives to build Apigee into a leading technology provider for digital business. Google (Apigee) is the cross-cloud API management platform that operates in a multi- and hybrid-cloud world. Chet successfully took Apigee public before the company was acquired by Google in 2016. Chet earned his B.S. in engineering from Arizona State University.

1 Billion Requests Per Day - Building and Running a Mission-Critical Platform in the Cloud

Tuesday, May 12, 2020 | | 10:55 AM - 11:35 AM

ArchitectOps and DevOps
Apache Cassandra™ Cloud Use Cases

Cristopher Blanco | Senior Manager Software Engineering, Capital One

My team in Capital One runs the mission-critical customer platform on top of Cassandra in AWS. This system is deployed in Active/Active mode in multiple regions in AWS. In this presentation, we will explain how we created the platform from scratch, how we went from zero to hundreds of millions of customers, the performance challenges we faced, and the learnings we got from this journey.

We'll provide details on how we keep the system and OS updated with AMI changes every two months and how we have automated the DevOps process using Ansible, Cloud Formation, and Terraform to make all the infrastructure changes easy.

Cristopher Blanco

Cristopher Blanco | Senior Manager Software Engineering, Capital One

Senior Manager, Software Engineering at Capital One. Leads a team building the next-generation mission-critical Customer Platforms for Capital One using Cassandra, Microservices, and Cloud technologies.

Truth Hurts: How to Migrate your RDBMS Data Model to Apache Cassandra

Tuesday, May 12, 2020 | | 10:55 AM - 11:35 AM

Ops and DevOps
Apache Cassandra™ Cloud

Amanda Moran | Software Engineer, Apple

I just took a DNA test, and it turns out my data model is 100% wrong. This session will focus on how to correctly data model for Apache Cassandra and NoSQL databases. Topics will include:

  • A brief comparison of relational databases and NoSQL databases
  • The benefits of Apache Cassandra
  • Transitioning a relational data model to a Cassandra data model
  • Common issues that can be solved with a good data model

This session is intended for folks new to Cassandra/NoSQL or folks transitioning from operations to a more data engineering and cloud-focused role.

Amanda Moran

Amanda Moran | Software Engineer, Apple

Amanda Moran is a Bay Area-based Software Engineer. Her passion is helping customers, users, and the community be successful. Previously, she worked for HP, Teradata, DataStax, and Apache Trafodion startup Esgyn. Amanda’s an Apache Committer and member of the PMC for Apache Trafodion. She’s worked on customer POCs, executive demos, distributed database cloud deployments, Python coding, data science workshops, has spoken at many conferences, Linux/Hadoop administration, and scripting—a little bit of everything. She has a master’s degree in computer science from Santa Clara University and a BS in biology from the University of Washington. In her spare time, she loves running, hanging out with her dog, and finding reasons to go to Disneyland.

A DIY Guide For Upgrading DataStax Enterprise Flawlessly

Tuesday, May 12, 2020 | | 10:55 AM - 11:35 AM

Ops and DevOps
Cloud DataStax Monitoring

Vijay Arthanari | Senior Engineering Manager, Capital One

Smit Davey | Software Engineer, Capital One

Ganesha Duggirala | Senior Software Engineer, Capital One

When you are managing large Cassandra clusters, you recognize the complexity of performing major version upgrades. For the Rewards team at Capital One, the job was more daunting with the Cassandra database holding a massive volume of critical financial data and servicing customer-facing applications with strict availability requirements. Yet, with meticulous planning and flawless execution, they pulled off a successful upgrade to DataStax Enterprise 6.0 with zero downtime and zero data loss while maintaining the high availability for their customers.

In this talk, Capital One engineers will take you through their upgrade journey, highlighting the challenges they encountered, lessons they learned and the solutions they developed to accomplish the mission. Come learn how to automate your upgrade process, find valuable tips and tricks, and walk away with a do-it-yourself guide for the upgrade!

Vijay Arthanari

Vijay Arthanari | Senior Engineering Manager, Capital One

Vijay Arthanari is a seasoned technology leader experienced in working with containers, big data, distributed databases, and cloud technologies. As a Senior Engineering Manager at Capital One, he currently leads full-stack engineering teams building solutions for their Rewards One loyalty platform. He is a DataStax Certified Professional on Apache Cassandra.

Smit Davey

Smit Davey | Software Engineer, Capital One

Smit Davey is passionate about distributed databases, cloud technologies, automation, and systems architecture. He is a Software Engineer at Capital One experienced in managing DevOps for Cassandra for their Rewards One loyalty platform. His experience with Cassandra is quite unique as the cluster needs to hold credit card transaction data and guarantee fast response times at a bank that is an industry leader in the movement to the cloud. This includes automation and giving back time to the engineers, cost optimization based on cluster behavior, and adopting a Cassandra mindset in the organization.

Ganesha Duggirala

Ganesha Duggirala | Senior Software Engineer, Capital One

Ganesha Duggirala has been a professional software engineer for over 15 years. He loves building awesome applications and solving problems.

Stress Testing Cassandra in Kubernetes with tlp-stress

Tuesday, May 12, 2020 | | 10:55 AM - 11:35 AM

Ops and DevOps
Apache Cassandra™ Kubernetes Monitoring

John Sanda | Consultant, The Last Pickle

tlp-stress is an open source, workload-centric tool that can be used for benchmarking and performance profiling. As users continue to adopt Kubernetes as a platform for running stateful workloads, it is essential to have updated tools for testing Cassandra.

In this talk John Sanda, from The Last Pickle, will introduce a Kubernetes operator for tlp-stress. He will provide a brief overview of what operators are. He will discuss how tlp-stress is configured and the different ways in which it can be run. He will also discuss how the operator provisions a cluster and integrates with Prometheus and Grafana for monitoring. Lastly, he will explore how some Kubernetes features like scheduling can impact performance.

Note that this talk does assume some basic understanding of Kubernetes. The focus of the talk, however, is on the features of the operator and as such does not require an advanced understanding of Kubernetes.

John Sanda

John Sanda | Consultant, The Last Pickle

As a Consultant at The Last Pickle John helps clients understand and solve problems with Apache Cassandra.

Prior to TLP John worked as an engineer at Red Hat leading projects to build management and monitoring systems using Apache Cassandra. He also has experience operating, maintaining, and deploying monitoring systems in Kubernetes.

John enjoys building and extending Cassandra tools for Kubernetes.

Building Apache Cassandra 4.0: Behind the scenes

Tuesday, May 12, 2020 | | 11:00 AM - 11:40 AM

Ops and DevOps
Apache Cassandra™ 4.0 Apache Cassandra™ Microservices

Dinesh Joshi | Apache Committer / Senior Software Engineer, Apache Software Foundation

Building a database is hard. Building a distributed database is harder. Building a distributed database that the industry relies on is even harder. Our goal to build Apache Cassandra 4.0 is to make it rock solid. In this talk, we go behind the scenes to show you how the Apache Cassandra community is building and testing Apache Cassandra 4.0 so that it is the most stable release ever!

Dinesh Joshi

Dinesh Joshi | Apache Committer / Senior Software Engineer, Apache Software Foundation

Dinesh A. Joshi has been a professional Software Engineer for over a decade building highly scalable realtime Web Services and Distributed Streaming Data Processing Architectures serving over 1 billion devices. Dinesh is an active contributor to the Apache Cassandra codebase. He has a Masters degree in Computer Science (Distributed Systems & Databases) from Georgia Tech, Atlanta, USA.

Networking Break and Lunch

Tuesday, May 12, 2020 | 11:40 AM - 1:00 PM

DataStax Enterprise 6.8 - Cloud-Native Apache Cassandra for Data at AI-Scale

Tuesday, May 12, 2020 | | 1:00 PM - 1:40 PM

ArchitectDeveloperOps and DevOps
Apache Cassandra™ Analytics Graph Kubernetes

Jonathan Lacefield | Product Manager, DataStax

Marc Selwan | Sr. Product Manager - Server Team, DataStax

This session highlights all of the recent product enhancements in the DataStax Enterprise product while also giving the audience some insight into what's coming next.

Jonathan Lacefield headshot

Jonathan Lacefield | Product Manager, DataStax

Jonathan is a Product Manager with DataStax focused on the strategy and vision for the DataStax Enterprise Server platform which includes Cassandra, Graph, Analytics, and Search. A 6-year veteran of DataStax, Jonathan has helped dozens of companies ideate, design, develop, and deploy global-scale applications built on DataStax Enterprise.

Marc Selwan headshot

Marc Selwan | Sr. Product Manager - Server Team, DataStax

Marc has been working at DataStax for over 5.5 years doing everything from helping customers build awesome applications with Apache Cassandra and DataStax Enterprise to working with our amazing engineering teams to build some great new features for our database.

Making Cassandra More Resilient

Tuesday, May 12, 2020 | | 1:00 PM - 1:40 PM

ArchitectOps and DevOps
Apache Cassandra™

Sumanth Pasupuleti | Senior Software Engineer, Netflix

Cassandra is well known for its ability to handle scale. Through years of published benchmarking results as well as experience from a number of production deployments across several companies, it is clearly evident that Cassandra is the leader among NoSQL databases in terms of throughput it can offer.

While Cassandra is known for its excellent performance, it is also important to ensure a Cassandra cluster is scaled appropriately for the expected load, without which, it is very easy to tip over a Cassandra cluster. Capacity planning is an important exercise in setting up a Cassandra cluster. Anyone with exposure to production support, however, knows that we can only plan for known workloads, and workloads can spike due to unforeseen circumstances - be it a bug or an unexpected load on the client - which can then cascade as a heavy load to the Cassandra cluster. A common consequence of such a heavy load would be Cassandra latency degrading or worse yet Cassandra servers failing due to running out of resources like memory, CPU, network bandwidth, etc.

In this session, I will talk about an improvement to Cassandra wherein the server can put back-pressure to the clients whenever Cassandra is in an overloaded state. This has recently been committed to trunk, tracked as part of the CASSANDRA-15013 Jira. In particular, this improvement would allow Cassandra to shed load rather than allowing it to cascade, expiring messages that cannot possibly be fulfilled and ensuring a bounded memory expenditure even under heavy load. This work also fixes the issue where, in a high load situation, Cassandra ends up blocking threads that are responsible for shipping results back to the client, thereby eventually running out of memory due to piling up of the results, pending shipping.

Sumanth Pasupuleti | Senior Software Engineer, Netflix

Sumanth Pasupuleti is a Senior Software Engineer at Netflix, working on the core data platform team, offering persistence and caching solutions within Netflix. He is passionate about distributed systems and is an active contributor to Apache Cassandra.

Lessons Learned: Building Cassandra DBaaS on Alibaba Cloud

Tuesday, May 12, 2020 | | 1:00 PM - 1:40 PM

Architect
Cloud

Guo Chao | NoSQL Database Engineer, Alibaba

Zhang Youdong | NoSQL Database Engineer, Alibaba

During this session, we will share the lessons we learned when we built the Cassandra as a Service on Alibaba Cloud. We will discuss how we boosted Cassandra performance through soft raid on cloud disk, why we do a continuous full incremental backup, and how to apply automatic data repairs. We will also talk about how we optimize a Cassandra service for different use cases.

Guo chao headshot

Guo Chao | NoSQL Database Engineer, Alibaba

Zhang youdong headshot

Zhang Youdong | NoSQL Database Engineer, Alibaba

Connecting Spark and Cassandra in the Cloud

Tuesday, May 12, 2020 | | 1:00 PM - 1:40 PM

Architect
Apache Cassandra™ Analytics Cloud Graph

Russell Spitzer | Software Engineer, DataStax

Sure you can connect Spark to Cassandra on-premises but have you ever done it… in the cloud? With services like DataStax Astra, all of your data can live in the cloud and with providers like Databricks, Amazon EMR and Google DataProc all of your Analytics can live in the cloud as well! But did you know that you can connect these services together?

With the new cloud-enabled DataStax Spark Connector, it's possible to connect all these services with a minimal amount of configuration. Come and learn all the best tips and tricks for getting your Cloud Cassandra Analytics game up to speed!

Russell Spitzer Headshot

Russell Spitzer | Software Engineer, DataStax

After completing his PhD work at University of California, San Francisco, Russell joined DataStax to fulfill his deep longing to work with distributed systems. Since then, he has worked with Cassandra, Spark, Tinkerpop, Hadoop, as well as a myriad of other big data technologies. His favorite hobby is finding new ways of bringing these technologies together so that everyone can benefit from the new information age.

Architecting Agile Instances of DataStax Enterprise Using Docker

Tuesday, May 12, 2020 | | 1:00 PM - 1:40 PM

Architect
DataStax

Kerry Cox | Curriculum Developer, DataStax

Docker solves many of the problems of spinning up and testing DataStax Enterprise (DSE) and Apache Cassandra in a short time and with minimal investment. Docker provides myriad images for users to download and implement on local desktops and systems with limited hardware or knowledge. This demonstration will walk attendees through the basics of configuring Docker on a local system as well as deploying a standard image to a desktop environment. Users should walk out with a functioning DSE image on their laptops and be able to leverage this knowledge in their workplace.

This presentation will cover the following topics:

  • The basics of how Docker works
  • Installing Docker
  • Downloading images
  • Generating and distributing custom images
  • Networking Docker
  • Docker security steps 
  • Additional uses
Kerry Cox

Kerry Cox | Curriculum Developer, DataStax

Kerry Cox is a Curriculum Developer for DataStax and has been doing Cloud computing since 2014. He has taught computer science courses for the past 10 years, has a Ph.D. in German, and speaks several languages, and is the author of several technical manuals covering Linux, security, networking, and instructional training.

Cassandra 4.0: Faster, Cheaper, Stronger

Tuesday, May 12, 2020 | | 1:55 PM - 2:35 PM

Ops and DevOps
Apache Cassandra™ 4.0

Jon Haddad | Cassandra Committer

Cassandra has long since been the best choice in distributed databases for systems that have to be online 24x7. Companies of all sizes trust Cassandra to be available when they need fast answers from both large and small datasets.

All this power has come at a high cost. Despite the advances in functionality over the last several years, we still have a reputation for being difficult and expensive to run. Popular opinion in the community is to run no more than a few TB per node to ensure the time it takes to add, remove, or replace nodes is kept to a minimum. Typically Cassandra is CPU bound rather than at the network or due to disk I/O if you're using SSDs. This means we need to add more node and scale horizontally rather than vertically.

Cassandra 4.0 aims to improve that significantly. Fewer, denser nodes can translate to enormous cost savings. To achieve this, we need to improve our CPU efficiency. In this talk, we'll explore how the performance improvements in Cassandra 4.0 translates to decreased costs in your data center. We'll look at how the following are affected:

  • Bootstrap, decommission
  • Compaction
  • Repair
  • Garbage collection

After this talk, you'll walk away knowing what to expect after you upgrade, and how to get the most out of your money!

Jon Haddad

Jon Haddad | Cassandra Committer

Jon Haddad is an Apache Cassandra committer and PMC member with almost twenty years of experience as a software engineer and has been using Cassandra since version 1.1. Jon is an alum of DataStax and more recently was the Principal Consultant at The Last Pickle. He has handled performance tuning on over a hundred clusters and is the primary author of several open source performance tools.

Building a SOX-compliant Apache Cassandra Pattern

Tuesday, May 12, 2020 | | 1:55 PM - 2:35 PM

Architect
Apache Cassandra™ Cloud Security

Aaron Ploetz | NoSQL Engineering Lead, Target

Jeff Blanchard | NoSQL Tech Lead, Target

One aspect of leading an enterprise database team today, is that sooner or later you may be asked to supply deployment and configuration patterns which meet requirements for Sarbanes-Oxley (SOX) compliance.  

As a publicly-traded company, Target Corporation is subject to external audits and must be able to demonstrate compliance with certain laws and regulations. However, most open source data tools were built to solve specific problems, and complying with security or auditing requirements is usually a bit of an afterthought.  This talk describes our technical journey from a freely-downloadable open source database to a hardened, SOX-compliant datastore.

Aaron Ploetz headshot

Aaron Ploetz | NoSQL Engineering Lead, Target

Aaron Ploetz has been a professional software developer since 1997, and has several years of experience working on and leading DevOps teams for startups and Fortune 50 enterprises. He has been named a DataStax MVP for Apache Cassandra three times, and has presented at multiple events; including the DataStax Summit, Cassandra Day Chicago, and Data Day Texas. Aaron continues to contribute to the software community by answering StackOverflow questions from other developers (mostly in the Cassandra tag), and has worked as an author on the books Seven NoSQL Databases in a Week and Mastering Apache Cassandra 3.x (3rd edition).

Aaron earned a B.S. in Management/Computer Systems from the University of Wisconsin-Whitewater, and a M.S. in Software Engineering (Database Technology emphasis) from Regis University. When not writing or coding, he enjoys astronomy, retro-video gaming, and skijoring. Aaron and his wife, Coriene, live with their three children in the Twin Cities area.

Jeff Blanchard headshot

Jeff Blanchard | NoSQL Tech Lead, Target

Jeff Blanchard has been a Database Administrator since 2000 and has several years of experience working with multiple database technologies. Jeff has worked on with Cassandra, MongoDB, Redis, MS SQL as well as Oracle.

Jeff earned a B.S. in Management from Kennesaw State University. When not working, he enjoys karate and volunteering in the community. Jeff and his wife, Kate, live with their three children in the Twin Cities area.

Optimizing Cassandra Ingestion with Precomputed Views

Tuesday, May 12, 2020 | | 1:55 PM - 2:35 PM

Architect
Apache Cassandra™ Apache Kafka® Event Streaming Microservices

Cliff Gilmore | Principal - Advanced Technology Group, Confluent

Learn how by combining the power of the Apache Kafka ecosystem with Apache Cassandra you can build efficiently precomputed views and enhance user experiences. This session will show common architectures utilized in the real world along with use case examples of how computing views in the stream allows for Cassandra to serve information across a wide variety of tables on the same data and adapt as the underlying questions evolve.

Cliff Gilmore headshot

Cliff Gilmore | Principal - Advanced Technology Group, Confluent

Cliff and Mitch both work at Confluent, by way of Datastax, where they help customers with Data Architecture and operational best practices every day. They have deep hands-on experience in the data ecosystem from ETL, stream, NoSQL, Hadoop, and other technology stacks and have worked with 100s of customers in the field implementing distributed systems.

How to Keep 144 Nodes Healthy and Happy

Tuesday, May 12, 2020 | | 2:50 PM - 3:30 PM

DeveloperOps and DevOps
Apache Cassandra™ Monitoring

Denson Pokta | Staff Software Engineer, Intuit

At Intuit Data Platform Persistent, we are running 8 clusters supporting popular online applications like TurboTax and multiple business units. Our largest cluster has 144 nodes running in AWS A/A in 2 regions. I am here to share our experience of creating and maintaining a persistent platform.

Denson Pokta | Staff Software Engineer, Intuit

I've worked with Cassandra for the last seven years. I've made my share of mistakes to get to where we are today as a platform at Intuit.

2020 Vision for Your Data Model

Tuesday, May 12, 2020 | | 2:50 PM - 3:35 PM

Developer
Apache Cassandra™

Patrick McFadin | VP Developer Relations, DataStax

As the state of application infrastructure constantly changes, Cassandra needs to keep pace with changes. Over the past year, there have been some interesting changes proposed that will keep Cassandra the best database for building applications.

This includes:

  • changes to the CQL protocol for more flexible queries
  • transactions with better guarantees
  • indexing choices that fit your use case 

These are just a few ideas that have been discussed. I’ll talk about the pros and cons of each and any late-breaking news on changes. Hopefully, you’ll clearly see the future is bright for our favorite database!

 Patrick McFadin headshot

Patrick McFadin | VP Developer Relations, DataStax

Patrick McFadin has been working with Apache Cassandra and data modeling techniques for almost 10 years. He has also worked as Chief Evangelist for Apache Cassandra and consultant for DataStax, where he helped build some of the largest and exciting deployments in production. Previous to DataStax, he was Chief Architect at Hobsons and an Oracle DBA/Developer for over 15 years.

DataStax Astra: Achieving a No-Ops Future for Cassandra

Tuesday, May 12, 2020 | | 2:50 PM - 3:30 PM

ArchitectDeveloper
Cloud DataStax

Matt Kennedy | Senior Director of Product, DataStax

Apache Cassandra is justifiably famous as the database of choice for many digital-native companies who deploy their apps exclusively in public clouds. Many large enterprises have followed that example and found success there as well, but not without some banged shins along the way. As powerful as it is, Cassandra can be a difficult bit of technology to implement and manage properly.

This talk will update the audience on the launch and future of Astra, our DBaaS aimed at achieving a No-Ops future for Cassandra.

Matt Kennedy

Matt Kennedy | Senior Director of Product, DataStax

Matt has more than 20 years of experience in distributed systems as a product manager, field architect, and software developer. He is currently the Product Manager for DataStax's Apollo DBaaS product.

The Stars in Spring, a Bootiful Combination

Tuesday, May 12, 2020 | | 2:50 PM - 3:30 PM

Developer
Cloud

Frank Moley | Engineering Manager, DataStax

If you are a Java Developer today, you know the power of Spring and Spring Boot for delivering applications rapidly to your users. See how the power and distributed nature of DataStax Astra combined with the legendary speed and agility of Spring Boot is a perfect match for delivering world-class distributed applications.

In this talk, we will walk through integrating the Java driver for Apache Cassandra with a Spring Boot Starter and use it to access data from a DataStax Astra database. We will also talk about some patterns of distribution, and why Apache Cassandra, and ultimately DataStax Astra provide the much-needed data backbone for globally distributing applications to bring the data closer to your customer.

Frank Moley

Frank Moley | Engineering Manager, DataStax

Frank Moley is a Lead Engineer for DataStax working on distributed cloud platforms. His focus areas of software development include cloud-native architectures, microservices architectures, DevOps and DevSecOps workflows, and general system design and development using Java, Python and Go. He is an open source software enthusiast and contributor. Much of time these days is spent build multi-cloud systems leveraging Kubernetes as a platform and all the automation and tooling that comes with it.

Frank is also a content author for LinkedIn Learning and has many offerings on the Spring Framework, Java Development, and Architecture. Finally, he is a co-founder of the KC Spring User Group as well as the KC Java User Group.

Using Cassandra in Neural Network Model Design

Tuesday, May 12, 2020 | | 3:40 PM - 4:20 PM

Architect
Apache Cassandra™ Apache Kafka® Machine Learning/AI Monitoring

Rich Lee | Professor, Department of Statistics and Information Science, Fu Jen Catholic University

This empirical case study is about using a Cassandra cluster within the neural network model design for image classification application.

The Cassandra cluster receives the metadata from an existing RDBMS, stores the source images, and the intermediate process data during the neural network analysis.

The objectives of this case study were:

  • Benchmarking the time-complexity of each neural network model.
  • Exploring the best practice and the solution architecture for high volume stressed neural network analysis.

 The challenges of the case study were: 

  • The Data Model design patterns for the metadata and the image contents;
  • The high volume stressed distributed workloads design within the cluster;
  • The design of neural network analysis processes using Cassandra;
  • The long-lifespan batch process scheme; 
  • The scheme of the issue tracking and the analytic phases report.
Rich Lee

Rich Lee | Professor, Department of Statistics and Information Science, Fu Jen Catholic University

Rich Lee, an Enterprise Architect, is a Ph.D. professor in the Department of Statistics and Information Science at Fu Jen Catholic University. He has served IBM as a senior technical consultant for a decade. Before IBM, he worked for HPE/DEC as well as a consultant in implementing the Manufacturing Execution System in the semiconductor industry for many years.

Cassandra Event Driven Automation with StackStorm

Tuesday, May 12, 2020 | | 3:45 PM - 4:25 PM

Ops and DevOps
Apache Cassandra™ Cloud Kubernetes

Serban Teodorescu | SRE, Adobe

Stefan Gusa | SRE, Adobe

We’re managing more than 30 Cassandra clusters in 8 AWS regions, 6-700 instances in total, and so far we've never run out of problems to automate. 

One persistent recurring problem is replacing Cassandra nodes when AWS instance event notifications are sent. This happens when AWS detects degraded hardware, or on maintenance events. We wanted a fully automated system to handle this, and which has the potential to handle more than that. We decided to use StackStorm on Kubernetes, due to the richness of its ecosystem and the potential of using workflows to decide on the chain of actions performed through other systems.

In this session we will cover:

  • The main problem, the need for a fully automated system that replaces Cassandra instances when AWS instance event notifications are sent, or in other situations.
  • StackStorm deployment with Kubernetes
  • AWS CloudWatch setup to capture these events.
  • Notification system
  • How to decide that a replacement is safe to do
  • Cassandra replacement
  • Triggering actions manually
  • Slack integration
  • Other usages of StackStorm, not all related with Cassandra (e.g., recover from reboot, improved alerting, dynamically altering parameters)
Serban Teodorescu

Serban Teodorescu | SRE, Adobe

Serban Teodorescu is an SRE at Adobe, where he’s part of a small team that manages 30+ Cassandra clusters for Adobe Audience Manager. Previously, he was a Python programmer, and he’s still trying to find out how a developer who preferred SQL databases ended up as an SRE for a Cassandra team. Apart from Cassandra and Python, he’s interested in automating his tasks with terraform, AWX, and StackStorm.

Stefan Gusa

Stefan Gusa | SRE, Adobe

Stefan Gusa has been a Site Reliability Engineer at Adobe since June 2019. He is part of a three-member team that manages Cassandra clusters, holding over 600 nodes in 8 AWS regions. Formerly he has been an intern in another team at Adobe, since June 2018, during his final Bachelor's degree year. He studied Computer Science at "Politehnica" University of Bucharest and is now pursuing a Master's degree in Parallel and Distributed Systems at the same university. He is a passionate programmer with skills in Python and Java, also interested in innovative engineering technologies such as Kubernetes.

On-Prem Automation at Scale

Tuesday, May 12, 2020 | | 3:45 PM - 4:25 PM

ArchitectOps and DevOps
Apache Cassandra™ Use Cases

Grant Cohoe | Senior Systems Engineer, Constant Contact

At Constant Contact, a team of one-and-a-half manage 420 nodes in 30 rings and still gets sleep at night. This session shares how we leverage configuration management tools (e.g., Puppet, Ansible) to automate cluster operations and provide self-service database changes to developers. We'll also share some experiences, tips, and tricks we've acquired over the last nine years. And maybe a ghost story or two.

Grant Cohoe

Grant Cohoe | Senior Systems Engineer, Constant Contact

Grant has worked in infrastructure operations for the last seven years. He holds a degree in Applied Networking and Systems Administration from the Rochester Institute of Technology. Other interests include cocktails, college hockey, and summer camp.

RESTing Easy With Spring and Cassandra

Tuesday, May 12, 2020 | | 3:45 PM - 4:25 PM

Developer
Apache Cassandra™ Microservices

Brian Hess | Strategic Solution Engineer, DataStax

REST APIs have been around for a long time, but they are still a mainstay for modern applications and a way to interact with persistent data stores (such as databases).  Spring Boot is a common framework for building such APIs.

This talk will discuss Spring Boot, how to use it with Cassandra and DataStax Enterprise, how to leverage object mappers (such as Spring Data and the DataStax Java Driver Object Mapper), and some tools to make this all really simple... so you can get back to resting :P

Brian Hess headshot

Brian Hess | Strategic Solution Engineer, DataStax

Brian has been in the data and analytics space for over 20 years, ranging from government to data mining applied research to analytics in enterprise data warehousing and NoSQL engines. Currently, Brian is a Strategic Solutions Engineer, working with some of DataStax's largest customers, and has held several other technical roles including Product Director for Analytics and Partner Architect. Brian started his career as a Cryptologic Mathematician in the US Department of Defense, where he worked on Data Science, Data Mining, and large scale data research. He then joined Netezza as Principal Mathematician and Director of Advanced Analytics, pushing Netezza to address new and advanced use cases. Brian has Masters degrees in both Mathematics and Computer Science from the Johns Hopkins University.

Code-less Data Pipelines with Apache Cassandra, Flink, and NiFi

Tuesday, May 12, 2020 | | 3:45 PM - 4:25 PM

ArchitectOps and DevOps
Apache Cassandra™ Apache Kafka®

Roger Huang | Lead System Architect, Visa

Most of us have Apache Cassandra as a key data store of our data platform. However, no data store is an island, even if it’s Cassandra. Rather our data platforms are graphs of data pipelines of data sources and data sinks, that may include real-time stream processing.

This talk shows how to create code-less data pipelines using:

  • Apache NiFi: web UI for creating and managing data pipelines
  • Apache Flink: framework for high throughput, low latency stateful computations over both unbounded and bounded data streams. APIs for stream processing, streaming SQL, Complex Event Processing
  • Apache Cassandra: distributed, elastically scalable partitioned row store with a flexible schema that’s highly available, partition tolerant with tunable consistency

You will see a demonstration of a data pipeline consisting of:
 Scala Kafka producer -> Kafka topic -> Flink Kafka consumer -> NiFi -> Cassandra

Roger Huang

Roger Huang | Lead System Architect, Visa

Roger enjoys building reactive systems using Scala and Akka. He has spoken on various Big Data topics at Visa, PayPal, Amazon, and ACM SIG, including Knowledge Discovery and Data Mining.

DAY 1: Closing Keynote

Tuesday, May 12, 2020 | | 4:45 PM - 5:35 PM

ArchitectDeveloperOps and DevOps
Keynote Apache Cassandra™ Cloud Kubernetes Machine Learning/AI

Sam Ramji | Chief Strategy Officer, DataStax

Quantum Entanglement, Cassandra, and Community: Approaching the Speed of Information

Envision our world in 2025. Data grows exponentially and we all must create and operate at Netflix-scale. As a community, we are guardians of the scale-out architecture that sees information move at the highest possible speed for anything that wraps the planet.

Looking back to 2020, what was our collective belief and values system? How did we work together to create a positive-sum engine of abundance? Join us in this invitation to create a ‘hallway track’ of collaboration during our time at Accelerate.

Sam Ramji

Sam Ramji | Chief Strategy Officer, DataStax

A 25-year veteran of the Silicon Valley and Seattle technology scenes, Sam Ramji has helped build two multi-billion dollar markets (API Management at Apigee and Enterprise Service Bus at BEA Systems) and redefined Microsoft’s open source and Linux strategy from “extinguish” to “embrace”.

He is nerdy about open source, platform economics, middleware, and cloud computing with emphasis on developer experience and enterprise software. He is an advisor to multiple companies including Accenture, Insight Engines, and the Linux Foundation. Sam received his B.S. in Cognitive Science from UCSD in 1994 and is still excited about artificial intelligence, neuroscience, and cognitive psychology.

Welcome Party!

Tuesday, May 12, 2020 | 5:45 PM - 8:00 PM

Wednesday, May 13, 2020

DAY 2: Opening Keynote

Wednesday, May 13, 2020 | | 9:00 AM - 10:30 AM

ArchitectDeveloperOps and DevOps
Keynote Apache Cassandra™ Cloud

Jonathan Ellis | Co-Founder & Chief Technology Officer, DataStax

Ed Anuff | Chief Product Officer, DataStax

Accelerating your Transformational Data Architecture

ED ANUFF. Users and enterprises are using NoSQL and Cassandra to create transformative experiences, applications and services. Kubernetes, Kafka, and other technologies are fanning the flames of change and provide massive new opportunities. Learn what we’re doing to keep you at the forefront of innovation and how we’re making it easier and exciting for you to catch the NoSQL wave.

 

Extending NoSQL: The Foundation for Transformational Data

JONATHAN ELLIS. The market has adopted non-relational technology like DataStax Enterprise and Apache Cassandra™ to solve specific problems around performance, scalability, and availability. But we’ve lost some things in that transition, too. Jonathan will explain how DataStax is extending the scope of NoSQL to include some of these features like ACID transactions, native indexing, and graph queries in all flavors of Cassandra.

Jonathan Ellis

Jonathan Ellis | Co-Founder & Chief Technology Officer, DataStax

Jonathan is a co-founder of DataStax. Before DataStax, Jonathan was Project Chair of Apache Cassandra for six years, where he built the Cassandra project and community into an open-source success. Previously, Jonathan built an object storage system based on Reed-Solomon encoding for data backup provider Mozy that scaled to petabytes of data and gigabits per second throughput.

Ed Anuff

Ed Anuff | Chief Product Officer, DataStax

Ed Anuff has over 25 years experience as a product and technology leader at companies such as Google, Apigee, Six Apart, Vignette, Epicentric, and Wired.  He led products and strategy for the successful Apigee API Platform for the last 8 years, helping to make it the recognized category leader and led to the acquisition of Apigee by Google.  He was the founder of enterprise portal leader Epicentric which was acquired by Vignette. 

In the 90's, at Wired, he launched one of the first Internet search engines, HotBot, as well as authored one of the first textbooks on the Java programming language.  Ed is a graduate of Rensselaer Polytechnic Institute (RPI).

Providing C* as a Platform for Netflix: Lessons that Led to Making Apache C* Autonomous

Wednesday, May 13, 2020 | | 10:45 AM - 11:25 AM

Architect
Apache Cassandra™

Vinay Chella | Builder, Netflix

Apache Cassandra is a vital part of the Netflix ecosystem, providing it as a first-class database offering to Netflix developers was an old episode but making it autonomous in an ever-changing cloud ecosystem is a fun part of a new episode.

In this session, you will get to experience the lessons we learned in a hard way, the autonomous infrastructure we built around it, and the mental models in providing Apache Cassandra as a platform.

Vinay Chella

Vinay Chella | Builder, Netflix

​Vinay Chella is an Apache Cassandra Committer and engineer at Netflix. He possesses a great understanding of Cassandra, distributed systems, and relational databases. As an Engineer and Architect, he has extensively worked on building scalable distributed services over a decade, helped several teams @ Netflix in designing highly efficient persistence solutions. He is a core contributor to Netflix's data store platform and currently spends the majority of his time with Cassandra.

Apache Cassandra 4.0 Diagnostics

Wednesday, May 13, 2020 | | 10:45 AM - 11:25 AM

Ops and DevOps
Apache Cassandra™ 4.0 Apache Cassandra™

Mick Semb Wever | Apache Cassandra Consultant, The Last Pickle

A preview of an exciting new feature coming in Apache Cassandra 4.0: Diagnostic Events; providing real time insights into your Cassandra internals. Observability is key to successfully operating Cassandra, as it allows users and developers to find bugs and identify runtime issues. Log files and metrics are a very popular way to get insights into a cluster’s behaviour, but they are limited to small text representations or time series data. Often important information is missing from log files and can’t be added without changing the source code and rebuilding Cassandra. Diagnostic Events have been designed to fill this gap by providing a way to observe all different types of changes that occur inside Cassandra as they happen.

This talk will introduce what Diagnostics Events are, and the types of events currently available, examples of what can be done with these event types, and how they have been internally implemented.

Mick Semb Wever headshot

Mick Semb Wever | Apache Cassandra Consultant, The Last Pickle

Beyond technology, talk to me about snowboarding, rock climbing, trail running, skiing, surfing, and just anything awesome in nature.

Next Generation Drivers: Why and How to Migrate

Wednesday, May 13, 2020 | | 10:45 AM - 11:25 AM

Developer
Apache Cassandra™

Cédrick Lunven | Developer Advocate, DataStax

"The last two years have been really busy for the driver team at DataStax:

  • 2018: released a new major version of the drivers, 4x better suited for the future
  • 2019: support for DataStax Apollo cloud service
  • 2020: merging enterprise and OSS drivers, offering all advanced capabilities to the community

Haven't adopted the new drivers yet? Don't worry! We have you covered.

During this session, we will detail what changed (and WHY). We'll then share a proposed migration guide as well as a sample CRUD application available in all versions for you to track the evolutions. Come, Developers! Don't fear the source code!"

Cédrick Lunven

Cédrick Lunven | Developer Advocate, DataStax

Cedrick is a Developer Advocate at DataStax where he finds opportunities to share his passions by speaking about developing distributed architectures and implementing reference applications for developers.

In 2013, he created FF4j, an open source framework for Feature Toggle which he still actively maintains. He is also a contributor to JHipster.

Hidden Features of Apache Cassandra 4.0

Wednesday, May 13, 2020 | | 11:40 AM - 12:20 PM

ArchitectDeveloperOps and DevOps
Apache Cassandra™ 4.0

Dinesh Joshi | Apache Committer / Senior Software Engineer, Apache Software Foundation

Apache Cassandra 4.0 is a huge community effort! It has over 400 patches including features and bug fixes. We have a lot of features that are well known and there are great features that are not so well known. In this talk, you will learn about some of those hidden features that might make your life easier, give you a great performance boost, or just surprise you!

Dinesh Joshi

Dinesh Joshi | Apache Committer / Senior Software Engineer, Apache Software Foundation

Dinesh A. Joshi has been a professional Software Engineer for over a decade building highly scalable realtime Web Services and Distributed Streaming Data Processing Architectures serving over 1 billion devices. Dinesh is an active contributor to the Apache Cassandra codebase. He has a Masters degree in Computer Science (Distributed Systems & Databases) from Georgia Tech, Atlanta, USA.

Migrating from SQL to NoSQL Database

Wednesday, May 13, 2020 | | 11:40 AM - 12:20 PM

ArchitectDeveloper
Apache Cassandra™

Neha Giri | Software Architect, Intuit

Topic: Strategy for using NoSQL database for legacy applications dependent on relational database

Audience: Architects, Developers.

What to expect: Real-life use cases of migrating from SQL to NoSQL data, lessons learned, architecture and benefits. 

Back story: I was involved in efforts for migrating to NoSQL from Oracle database for 2 organizations at Ericsson and Intuit. With NoSQL, I saw awesome benefits in scalability and performance. However, converting from relational to NoSQL is not easy as the relations between the tables need to be modeled in a different way in NoSQL due to differences from relational databases. Also, it requires a different way of thinking - thinking in terms of hierarchical entities instead of tables sharing foreign keys.

Topics:

  • Fundamental differences between relational and no sql databases and benefits of using no sql database
  • How to evaluate if your application's use case is fit for migration to no sql database
  • How to re-design your schema to fit into no-sql non-relational database
  • How can database queries be re-designed for no sql database to increase from performance and scalability
  • Migration and testing strategies so that there is no negative impact on application functionality and integrity
Neha Giri Headshot

Neha Giri | Software Architect, Intuit

Neha Giri is software architect for consumer group at Intuit Canada Inc. She has over 13 years of extensive experience in software development in telecom and financial domains. She has a Bachelor of Engineering in Computer Engineering from Delhi University. She is passionate about software development, scalability and database architecture.

Cassandra Workloads on Microsoft Azure: Best Practices and Learnings

Wednesday, May 13, 2020 | | 11:40 AM - 12:20 PM

ArchitectOps and DevOps
Apache Cassandra™ Cloud

Boris Baryshnikov | Principal PM Manager, Microsoft Azure Compute Product Group

In this session, we will provide an overview of how Microsoft Azure supports the vast open source software ecosystem focusing on best practices, performance tuning, and learnings for running your Cassandra workloads on Azure.

Boris Baryshnikov

Boris Baryshnikov | Principal PM Manager, Microsoft Azure Compute Product Group

Boris Baryshnikov is a principal lead program manager in the Azure Compute product group. Boris's team is focusing on use, growth and end-to-end experiences of Linux workloads on Azure. This includes working with numerous partners such as SAP, Red Hat, Oracle, and others.

Networking Break and Lunch

Wednesday, May 13, 2020 | 12:20 PM - 1:10 PM

What is Autonomous Testing?

Wednesday, May 13, 2020 | | 11:40 AM - 12:20 PM

Ops and DevOps
Machine Learning/AI

Artem Golubev | CEO & Co-Founder, testRigor.ai

Yet another type of testing?
Why do we need it?
Why did we choose to use Cassandra and how do we use it? (In terms of 1) CAP theorem, 2) inserts vs read speed)
 
 Classification of types of autonomous testing:

  •   Types of functionality recognition/code generation
    •   Discovery/rules-based
    •   Pattern-recognition-based
    •   Combinatorial-based
  •   Types of testing:
    •   Regression
    •   Discovery

How do different types of systems under test affect tests support?
What types of autonomous tests are one-off and what types are supportable.
 
Will walk through use cases for each type and when they are applicable.
Talk about cases where all of them are not applicable.
How autonomous testing can be integrated into CI/CD. Where integration to CD is possible and where it is not possible.
 
You’ll see different examples in practice.
 
What you would learn:

  • What is Autonomous testing and what are classes of it.
  • In what situation each type of autonomous testing is applicable.
  • When autonomous testing is not applicable.

Artem Golubev | CEO & Co-Founder, testRigor.ai

Artem Golubev is a software veteran, with more than 20 years in Software Development building various B2B Enterprise and E-Commerce software at companies like Microsoft, Salesforce. He is a Certified Enterprise Architect, holds CC, CL, ACB certifications from Toastmasters. During his career, he’s learned a lot about infrastructure, code management, and deployment and was Senior Engineering Manager of Infrastructure at Salesforce. More recently he’s dedicated time to automated software testing and is working as CEO at TestRigor. He is excited about technology helping people to solve their tedious tasks. Especially zero-downtime no-ops autonomous software that manages itself. He also enjoys snowboarding, full-bodied red wines and just plain fun.

Event Driven Automation to Enhance Customer Experience

Wednesday, May 13, 2020 | | 1:10 PM - 1:50 PM

Ops and DevOps
Event Streaming Monitoring

Arun Singh | Site Reliability Engineer, Adobe Systems

Event driven automation (EDA) is a framework containing computer programs and workflows written to “listen” and respond to events generated by the user or the system. Applications rely on programming that separates event-processing logic from the rest of its code. With EDA, an event can be any identifiable occurrence that has significance for the workflow for which it is designed. Examples might include events caused by a large user-generated volume of requests and system-generated events such as program failing to load, sensor outputs, or messages from individual threads. EDA is accomplished through sensors that listen for the events, which then trigger a potentially complex sequence of actions either sequentially or in parallel. These actions form a workflow where values derived from a set of actions are passed through to a subsequent set of actions based on specified conditions or predetermined criteria. These actions can be written in any programming language to improve responsiveness, throughput, and flexibility in a given workflow.

In this talk, I will be sharing my journey towards self healing infrastructure, which started with implementing self healing via SaltStack then moving towards Rundeck and now finalised on StackStrom. I will also talk about how self healing is helping us to resolve thousands of alerts without SRE intervention. This includes the pros/cons and highs/lows we encountered in our journey while looking for a suitable workflow engine, how we added new things in existing technologies to make it fit for our environment, and how users can learn from our mistakes to setup in your own environment in hours.

I will be talking about different workflow engines and provide a brief comparison between them. This includes which to use when.

Read my blog for more information: https://medium.com/adobetech/how-adobe-experience-platform-is-using-event-driven-automation-to-enhance-customer-experience-d1ee3e4d3118

Arun Singh

Arun Singh | Site Reliability Engineer, Adobe Systems

Graph in Cassandra: A Deep Dive into Distributed Adjacency Lists and Graph Database Modeling

Wednesday, May 13, 2020 | | 1:10 PM - 1:50 PM

Apache Cassandra™ Graph

Denise Gosnell | Chief Data Officer, DataStax

Matthias Broecheler | Chief Technologist, DataStax

When you are using DataStax Graph, you are working with graph data in Cassandra. This session will give you a peek under the hood to illustrate the fundamental concepts of working with graph data structures in Cassandra. We will walk through how we use primary keys, partition keys, clustering columns, and materialized views to create vertices and edges across Cassandra partitions.

You will take away two main new ideas from this talk. First, you will understand how we model graph data as distributed adjacency lists within Cassandra. Second, you will learn a list of 10 best practices for modeling and using your data as a graph.

The extended version of this content is available in 'The Practitioners Guide to Graph Data' by D. Gosnell and M. Broecheler.

Denise Gosnell headshot

Denise Gosnell | Chief Data Officer, DataStax

Dr. Gosnell's career centers on her passion for examining, applying, and evangelizing the applications of graph data. Her most recent book, The Practitioners Guide to Graph Data with co-author Dr. Matthias Broecheler, details the best practices of working with graph data in Casssandra. In addition to her newly published book, she has patented, built, published, and spoken on dozens of topics related to graph theory, graph algorithms, graph databases, and applications of graph data across all industry verticals.

Prior to her role with DataStax, Gosnell worked in the healthcare industry, where she contributed to software solutions for permissioned blockchains, machine learning applications of graph analytics, and data science. She earned her Ph.D. in Computer Science from the University of Tennessee as an NSF Fellow. Her research coined the concept "social fingerprinting" by applying graph algorithms to predict user identity from social media interactions.

Matthias Broecheler headshot

Matthias Broecheler | Chief Technologist, DataStax

Dr. Matthias Broecheler is a technologist and entrepreneur with substantial research and development experience who is focused on disruptive software technologies and understanding complex systems. Dr. Broecheler’s is known as an industry expert in graph databases, relational machine learning, and big data analysis in general. He is a practitioner of lean methodologies and experimentation to drive continuous improvement. Dr. Broecheler is the inventor of the Titan graph database and a founder of Aurelius.

The Missing Manual for Time Window Compaction Strategy

Wednesday, May 13, 2020 | | 1:10 PM - 1:50 PM

DeveloperOps and DevOps
Apache Cassandra™

Alice Lottini | Vanguard Architect, DataStax

In this session, we will be taking an in-depth look at time window compaction strategy: what it is for, how it works internally, how to tune it, and how to work with it at scale. We will look at how to use DSBench to performance test TWCS for your use case, with some practical examples.

Alice Lottini headshot

Alice Lottini | Vanguard Architect, DataStax

Alice joined DataStax five years ago. She spent most of this time working with enterprises to design, deploy and scale a variety of applications based on DataStax. She recently moved to the Vanguard team where she continues to focus on strategic projects, both internal and external. She has a strong interest in the Cassandra internal mechanisms and how they perform at scale.

Prior to joining DataStax, Alice graduated in Computing Engineering from Politecnico di Torino. She worked for 12 years as a developer and technical architect in several industries, always with a focus on databases and data design, mostly on relational technologies - until she started using DataStax and never looked back!

Distributed Risk Calculate Engine

Wednesday, May 13, 2020 | | 1:10 PM - 1:50 PM

ArchitectDeveloper
Analytics Use Cases

Robert Czupiol | Principal Expert - Architect, ING Bank Slaski Poland

During the presentation, I want to show how we developed a distributed risk calculate engine base on DataStax Enterprise (DSE) platform. In this project, we used, of course, the Cassandra database, Spark build-in DSE, DSEFS, AlwaysOn SQL, Advanced Replication and some of own stuff written in Java. The major problem, in this case, was two types of calculations - everyday batch and triggered by event on-demand. 

I'll describe our road to make this solution bulletproof (ha, reliable) and also fast enough for business. So I have to show some architecture diagrams and some meat (Java code). This talk will be full of pitfalls, tips and tricks.  

So I'm going to talk about some technology stuff from DSE (included advantages and disadvantages of this solution) and I'm going to show how use them in practice to make an awesome solution.

Robert Czupiol | Principal Expert - Architect, ING Bank Slaski Poland

Principal Expert Architect and Java developer in ING Bank Slaski - Poland. 6 years exp in Cassandra on production - architect, developer and devops in one ninja person. Certificated in Cassandra during Cassandra Summit (San Jose). Speaker at DataStax Accelerate, Premier Apache Cassandra Conference (Washington DC) and some domestic conference like JDD Cracow, SpreadIT Katowice and some meetups. Trainer at Cassandra workshops.

New Secondary Indexing in DataStax Enterprise - Storage Attached Index

Wednesday, May 13, 2020 | | 2:05 PM - 2:45 PM

DeveloperOps and DevOps

Marc Selwan | Sr. Product Manager - Server Team, DataStax

Application requirements are always evolving and changing. While modeling your data after your queries gets you the best performance, sometimes you just want to add a bit of flexibility to your data models without having to remodel everything. In this talk, learn all about the new secondary indexing engine we're building into DataStax Enterprise to give you some flexibility in your applications without a ton of extra overhead.

Marc Selwan headshot

Marc Selwan | Sr. Product Manager - Server Team, DataStax

Marc has been working at DataStax for over 5.5 years doing everything from helping customers build awesome applications with Apache Cassandra and DataStax Enterprise to working with our amazing engineering teams to build some great new features for our database.

Live Traffic Replay in Cassandra 4.0: Full Query Logging

Wednesday, May 13, 2020 | | 2:05 PM - 2:45 PM

Ops and DevOps
Apache Cassandra™ 4.0

Aleks Volochnev | Developer Relations | DataStax

The upcoming Cassandra release brings a very new feature: Full Query Logging. Developed by the Netflix team and the Apache Community, it brings an incredible tool to capture, investigate and replay production traffic on any test or staging environment with ease.

In this session, we take a deep dive into Cassandra Logging and discuss the main benefits, limitations, ways to use, and internals of the FQL. For those wishing to experiment with the infrastructure using real workload without affecting production servers, it’s the way to go!

Aleks Volochnev headshot

Aleks Volochnev | Developer Relations | DataStax

Eliminating Operational Blindspots on Cassandra and DataStax Clusters

Wednesday, May 13, 2020 | | 2:05 PM - 2:45 PM

Ops and DevOps
Apache Cassandra™ Cloud DataStax

Manikandan Srinivasan | Senior Director, Product Marketing, DataStax

With today's complex applications, sending Apache Cassandra and DataStax diagnostic data to a centralized, scalable, and flexible monitoring solution is critical for providing fast and accurate problem resolution.

In this talk, we'll preview DataStax Insights, a new performance monitoring solution that quickly identifies anti-patterns, cluster performance bottlenecks, and database configuration issues with a unified view across all of your deployments. We'll demonstrate how you can troubleshoot potential issues with click-through discovery as well as automate performance tuning with AI-powered analysis and recommendations.

Manikandan Srinivasan

Manikandan Srinivasan | Senior Director, Product Marketing, DataStax

Mani Srinivasan is a Senior Director within DataStax’s product management team where he leads the strategy and vision for the DataStax Enterprise tooling solutions and various cloud initiatives.

Migrating a Microservice to DataStax Astra in 10 Lines of Code

Wednesday, May 13, 2020 | | 2:05 PM - 2:45 PM

ArchitectDeveloper
Cloud DataStax Microservices

Jeffrey Carpenter | Director of Developer Advocacy, DataStax

In this talk, we'll do a live demonstration of porting a Java microservice implementation from an Apache Cassandra cluster to a cluster running in DataStax Astra. We'll explore how easy it is to create an Astra cluster, download connection information into a development environment, and update service connection code to seamlessly connect to any Cassandra-compatible cluster using the file-based configuration features of the DataStax Java Driver.

Jeffrey Carpenter headshot

Jeffrey Carpenter | Director of Developer Advocacy, DataStax

Jeff Carpenter leads the Developer Advocate team at DataStax, using his background in system architecture, microservices and Apache Cassandra™ to help empower developers and operations engineers to build distributed systems that are scalable, reliable, and secure. Jeff has worked on large-scale systems in the defense and hospitality industries and is the author of Cassandra: The Definitive Guide, 2nd Edition (3rd Edition on the way!).

Rethinking Performance and Scale Testing with NoSQLBench

Wednesday, May 13, 2020 | | 2:05 PM - 2:45 PM

DeveloperOps and DevOps
Apache Cassandra™ DataStax

Jonathan Shook | Solution Architect, DataStax

In this session, we’ll explain what DSBench is, how it is used for performance testing and scaling math, and how to get started as a first time user. We’ll demo how you can use DSBench in a few key scenarios such as performance analysis, data modeling, and cluster sizing.

Jonathan Shook

Jonathan Shook | Solution Architect, DataStax

Jonathan Shook works as a Solution Architect at DataStax. He has been building distributed systems for over 15 years, with an equal focus on operational and functional design aspects. He helps customers build highly available and scalable systems, emphasizing whole-system thinking, collaborative design, and empirical testing methods.

Going Multi-Cloud with the DataStax Enterprise Kubernetes Operator

Wednesday, May 13, 2020 | | 2:55 PM - 3:35 PM

ArchitectOps and DevOps
Cloud Kubernetes

Christopher Bradford | Product Manager, DataStax

Deciding when and how to go multi-region or even multi-cloud is no small task. Navigating the gauntlet of various provider offerings and Virtual Private Cloud implementations is daunting. Additionally, secure connectivity and service discovery across Kubernetes clusters requires cloud networking acumen. Plus, are there any hidden costs or operations concerns associated with this kind of deployment? We will explore all of these points and more culminating in a demonstration.

Christopher Bradford headshot

Christopher Bradford | Product Manager, DataStax

Christopher has a passion for enabling efficiency through automation. From promoting effortless scaling via Cassandra to DevOps pipelines with infrastructure automation and containers he is here to get work done and enable operators to reset easy. He has developed applications consumed by millions and manned a Network Operations Center during the graveyard shift. In his heart he is a proponent for freedom from the tedious and growth through working smarter not harder.

How to Contribute Code to Cassandra

Wednesday, May 13, 2020 | | 2:55 PM - 3:35 PM

ArchitectDeveloper
Apache Cassandra™

Mick Semb Wever | Apache Cassandra Consultant, The Last Pickle

Using open source software gives you more control over the products and services you make (oh and it’s free, which is nice). You get to help direct how the software is created: need a new feature? Just ask, and maybe someone will add it. The best way to see Cassandra improve is to contribute code: a little or a lot, for your needs or for a bug another person found. After ten years of development, the code base can seem big and scary, but it’s just code, it’s all knowable, and there are people in the community who will help you and value your contribution.

In this talk, Mick Semb Wever, from The Last Pickle, will provide a beginner introduction building the various project components, writing code and tests, submitting a patch, and anything else needed to get your first contribution shipped.

Mick Semb Wever headshot

Mick Semb Wever | Apache Cassandra Consultant, The Last Pickle

Beyond technology, talk to me about snowboarding, rock climbing, trail running, skiing, surfing, and just anything awesome in nature.

C*NSYNC: Design Patterns for Consistency Sensitive Applications

Wednesday, May 13, 2020 | | 2:55 PM - 3:35 PM

Developer
Apache Cassandra™ Use Cases

Jeremy Hanna | Data Architect, DataStax

Distributed systems such as Apache Cassandra are uncommonly resilient and scalable. Cassandra achieves this by having a masterless design where each replica is independent and the client application determines what consistency is required. Some applications and domains are sensitive to consistency such as payment systems and inventory for retail and travel. Yet many companies are successful at building these applications on Cassandra and a number of design patterns and best practices have emerged.

This talk will cover these design patterns and review the built-in features of Cassandra to keep data across replicas and tables in sync.

Jeremy Hanna

Jeremy Hanna | Data Architect, DataStax

Jeremy Hanna has been working with Apache Cassandra and related technologies since 2010 when he worked on a team using and contributing to Cassandra at Rackspace. Over the last 10 years, he's helped companies around the world successfully deploy clusters and applications that require scale and resilience. He's been at DataStax since 2012.

I Have a Cassandra Data Model, But What's The Architecture?

Wednesday, May 13, 2020 | | 3:00 PM - 3:40 PM

Architect
Apache Cassandra™ Event Streaming Microservices

Jeffrey Carpenter | Director of Developer Advocacy, DataStax

You've seen the talks on Cassandra data modeling and maybe you've even taken DS220 on DataStax Academy, but now you need an architecture that takes advantage of that awesome denormalized data model you created. In this talk, we’ll walk through a concrete example of the relationship between data models and architecture, including how to identify services based on your data model, and how to manage relationships between different data types across service boundaries.

Jeffrey Carpenter headshot

Jeffrey Carpenter | Director of Developer Advocacy, DataStax

Jeff Carpenter leads the Developer Advocate team at DataStax, using his background in system architecture, microservices and Apache Cassandra™ to help empower developers and operations engineers to build distributed systems that are scalable, reliable, and secure. Jeff has worked on large-scale systems in the defense and hospitality industries and is the author of Cassandra: The Definitive Guide, 2nd Edition (3rd Edition on the way!).

Marketing Contact Data Foundation

Wednesday, May 13, 2020 | | 3:05 PM - 3:45 PM

ArchitectOps and DevOps
Cloud DataStax Use Cases

Vipul Ashta | Manager, Data and Analytics, Cisco Systems

Maniyarasan Selvaraj | Senior Data Engineer, Cisco

Digital Marketing leverages channels such as email, web, social, paid media, third party sites to connect with current and prospective customers. The challenge we have is to connect all the interactions and provide insightful data to the marketers to understand better with the current context of the customer/prospect.

At Cisco, we built a system called ‘Contact Data Foundation’ which enables seamless real-time personalized customer experience by creating a holistic threaded contact and account foundation and activating the predictive and prescriptive insights through the right channel at the right time. The following are the few core areas we have used DataStax Enterprise to achieve the same:

  • Identities threading 
  • Real-time data visibility
  • Interactions threading and Single Customer View - Thread the interactions from various channels as time-bound events, the time series enabled data store with Cassandra helps to enable this capability
  • Real-time Integration with channels over APIs – Since the DSE ecosystem has the Solr built on top of core Cassandra, it helps to provide a real-time response through APIs for web, email and call centers

Vipul Ashta | Manager, Data and Analytics, Cisco Systems

Maniyarasan Selvaraj | Senior Data Engineer, Cisco

DAY 2: Closing Keynote

Wednesday, May 13, 2020 | | 4:00 PM - 5:00 PM

ArchitectDeveloperOps and DevOps
Keynote Cloud

Corey Quinn | Cloud Economist, The Duckbill Group

Tilting at Windmills: The Myth of Multi-Cloud

As far as best practices go, multi-cloud is a terrible idea. It's also a growing reality in many environments. How can you align with the current state of the art while deploying environments that exist beyond whiteboards? If you go "all-in" on one cloud provider, what does that mean for third party vendors in your stack?

In this lighthearted and entertaining talk, Cloud Economist Corey Quinn discusses the ebb and flow of the multi-cloud narrative, why database selection is critical to this discussion, and how you can embrace the best solutions possible while respecting your resource constraints. After all, it's easy to get on stage and talk about what you should be doing; this talk instead talks about what you *will be* doing.

Corey Quinn

Corey Quinn | Cloud Economist, The Duckbill Group

Corey is the Cloud Economist at The Duckbill Group. Corey specializes in helping companies improve their AWS bills by making them smaller and less horrifying; hosts the Screaming in the Cloud and AWS Morning Brief podcasts; and curates Last Week in AWS, a weekly newsletter summarizing the latest in AWS news, blogs, and tools, sprinkled with snark.

Accelerate 2019

Not sure where to start? Check out all the great sessions from Accelerate 2019 to get a feel for what you can expect.

See Keynotes and Sessions