Jeffrey Carpenter

This issue is guest edited by Rebecca Mills (<a href="https://twitter.com/rebccamills">@rebccamills</a>), DataStax Developer Learning:

If you have been working with databases for a while, indexing is probably a familiar concept.

Database indexes enhance your data model and make your queries more efficient. Although Cassandra has had secondary indexes for a long time, indexing in itself is generally associated with several tradeoffs and problems. Many Cassandra experts have recommended avoiding use of indexing because of these tradeoffs, and as a result, we as a community have emphasized using denormalization to maximize performance of our queries.

The two previous secondary indexing implementations in Cassandra are Storage Attached Secondary Indexing (SASI) and Secondary Indexes (or 2i for short). The two main challenges with these implementations have been (1) write amplification and (2) index size on disk. SAI represents a huge improvement to both of these pain-points.

As Jonathan Lacefield wrote in his <a href="https://www.datastax.com/blog/2020/09/eliminate-trade-offs-between-database-ease-use-and-massive-scale-sai-storage-attached">recent blog</a>, the new Storage Attached Index (SAI) addresses these issues, while also creating opportunities for more flexible queries in Cassandra. SAI has been designed with a format sympathetic to Cassandra’s SSTables to use significantly less disk space. Through extensive testing and optimization, SAI supports faster writes than Cassandra or DSE Search indexes.&nbsp;

Give SAI a try in your free <a href="https://astra.datastax.com/register?utm_source=devplay&amp;utm_medium=newsletter&amp;utm_campaign=20200919">Astra</a> cluster. SAI is also available in <a href="https://www.datastax.com/products/datastax-enterprise">DataStax Enterprise 6.8.3</a>. For a hands on learning experience, check out the new <a href="https://www.datastax.com/dev/cassandra-indexing">Cassandra Indexing Skills Page</a> on our <a href="https://datastax.com/dev">Developer site</a>, and read up on more details in the <a href="https://docs.astra.datastax.com/docs/using-storage-attached-indexing-sai">Astra SAI Documentation</a>.

What’s next for SAI? DataStax has submitted the Apache <a href="https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-7%3A+Storage+Attached+Index">CEP</a> to bring this functionality to the Apache version of Cassandra. We’d love your feedback to help refine this feature for the benefit of the worldwide Cassandra community.

<h4>Example of the Week&nbsp;</h4>

Our featured example for this week is a quick Storage-Attached Indexing demo. Download the schema and data set, open your cqlsh and follow along with Patricia Gorla (<a href="https://github.com/pgorla">pgorla</a>), Solutions Architect at Datastax, as she walks you through the basics of SAI:

<ul>
	<li>Watch the Expect Advice video <a href="https://www.youtube.com/watch?v=KEdnexQJ-eM&amp;t=2s">Storage-Attached Indexing: A Brief Overview</a></li>
	<li>See the code and walkthrough here: <a href="https://github.com/DataStax-Examples/sai-demo">Storage-Attached Index Demo</a></li>
	<li>See the video walkthrough here: <a href="https://www.youtube.com/watch?v=xkQT3DJdnCg">YouTube video</a></li>
</ul>

Have fun trying out this new breed of indexing, and let us know if you have any questions on this example or suggestions for future examples at <a href="mailto:developer@datastax.com">developer@datastax.com</a> or <a href="https://twitter.com/datastaxdevs?lang=en">@DataStaxDevs</a>.

<h4>Upcoming Events&nbsp;</h4>

<ul>
	<li>September 23: Cloud-native Cassandra Developer Workshop: <a href="https://www.eventbrite.co.uk/e/cloud-native-cassandra-workshop-build-cassandra-microservices-with-spring-tickets-119419917187">Building Cassandra Microservices with Spring</a></li>
	<li>September 24: Cloud-native Cassandra Developer Workshop: <a href="https://www.eventbrite.co.uk/e/cloud-native-cassandra-workshop-build-cassandra-microservices-with-spring-tickets-119419917187">Building Cassandra Microservices with Spring</a></li>
	<li>September 30: <a href="https://www.eventbrite.co.uk/e/cloud-native-cassandra-workshop-introduction-to-cassandra-for-developers-tickets-118182927317">Cloud-native Cassandra Developer Workshop: Introduction to Cassandra</a></li>
	<li>October 1: <a href="https://www.eventbrite.co.uk/e/cloud-native-cassandra-workshop-introduction-to-cassandra-for-developers-tickets-118182927317">Cloud-native Cassandra Developer Workshop: Introduction to Cassandra</a></li>
</ul>

<h4>Community Highlights from <a href="https://cassandra.link/">Cassandra.Link</a></h4>

<ul>
	<li><a href="https://cassandra.link/post/generate-your-spring-boot-angular-react-applications">JHipster </a>- Automatically generate a data access layer, API, and a front end via Angular/React for data in Cassandra! (Blueprints available in Kotlin, .NET, as well Java)</li>
	<li><a href="https://www.youtube.com/playlist?list=PLmZzyjM-vqX6f0WQYhHgIv5K-esMRcbyr">Cassandra Lunch Recordings on Youtube</a> - Weekly recordings from an informal Cassandra meetup (<a href="https://www.meetup.com/Cassandra-DataStax-DC/">Cassandra &amp; Datastax DC</a> &amp; <a href="https://www.meetup.com/Cassandra-Chicago">Cassandra Chicago</a>) on Zoom, recordings available on Youtube. Join any Wednesday 11PM CST/12PM EST</li>
	<li><a href="https://cassandra.link/post/datastax-toolkit-diagnostic-collection">Diagnostic Collection Tool</a> - Analyzing the issues on a Cassandra / DataStax cluster is not always possible online. Here’s a very useful script to gather logs/conf from a cluster.&nbsp;</li>
</ul>

<h4>New Podcast</h4>

DataStax’s Chief Strategy Officer, Sam Ramji (<a href="https://twitter.com/sramji?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor">@sramj)i</a> is hosting a new podcast series called <a href="https://www.datastax.com/resources/podcast/open-source-data">Open||Source||Data</a> that just launched this week. He’ll explore open-source data, open-source software, data on Kubernetes, data in DevOps, and data in AI with old friends and new friends. Don’t miss out on the first podcast from Patricia Boswell and upcoming podcasts from Matt Asay, Rachel Chalmers, and Kelsey Hightower by subscribing on <a href="https://open.spotify.com/show/0xzZin9suOwe9p8zFj3tuF?si=Pa3lvJrWTjKKZruz4c72ZQ">Spotify</a>, <a href="https://podcasts.apple.com/us/podcast/open-source-data/id1530428904">Apple Podcasts</a>, or <a href="https://play.google.com/music/listen?u=0#/ps/Izsix257pos65fvx6eatayzwwnq">Google podcasts</a>.&nbsp;

 
Have a suggestion or story to share? We’d love your feedback: <a href="mailto:developer@datastax.com">developer@datastax.com</a> | <a href="https://twitter.com/datastaxdevs?lang=en">@DataStaxDevs</a>.&nbsp;

Developer Newsletter: Simplify your Cassandra Data Model with Better Indexing

Jeffrey CarpenterSoftware Engineer - Stargate

Share

Share

Example of the Week

Upcoming Events

Community Highlights from Cassandra.Link

New Podcast

More Company

DataStax Acquires Langflow to Accelerate Generative AI Development

The Top 5 DataStax Stories from 2023

2023 Recap: Data = AI

DataStax Astra DB Nabs Three Prestigious 2023 TrustRadius “Best of” Awards, Dominates the Vector Databases Category

One-stop Data API for Production GenAI