Toggle Menu

What is Apache Cassandra?

Apache Cassandra® is the only distributed NoSQL database that delivers the always-on availability, blisteringly fast read-write performance, and unlimited linear scalability needed to meet the demands of successful modern applications.

Create an Astra Account
Apache Cassandra logo

Cassandra 4.0 is here! Learn more about the fastest, most reliable, and easiest-to-run open-source Cassandra to date.

Learn More

Want to run open-source Cassandra with peace of mind? Learn about DataStax Luna, enterprise support for Apache Cassandra.

Learn More

Apache Cassandra explained

Apache Cassandra is a distributed NoSQL database created at Facebook and later released as an open-source project in July 2008.

Cassandra delivers the continuous availability (zero downtime), high performance, and linear scalability that modern applications require, while also offering operational simplicity and effortless replication across data centers and geographies. It can handle petabytes of information and thousands of concurrent operations per second, enabling organizations to manage large amounts of data across hybrid and multi-cloud environments.

Apache Cassandra vs. traditional relational databases

Cassandra differs from a typical relational database in the following ways:

Apache CassandraRelational Database
Handles high incoming data velocityHandles moderate incoming data velocity
Supports simple transactionsSupports complex/nested transactions
No single points of failure; constant uptimeSingle points of failure with failover
Supports very high data volumesSupports moderate data volumes
Decentralized deploymentsCentralized deployments
Data written in many locationsData written in mostly one location
Supports read and write scalabilitySupports read scalability (with consistency sacrifices)
Deployed in horizontal scale-out fashionDeployed in vertical scale-up fashion

Please check out our NoSQL primer, if you'd like to learn more about how Cassandra and other NoSQL databases compare to relational databases.

History of Apache Cassandra

Apache Cassandra’s key features and benefits

Whether you need to process server logs, emails, social media posts, or PDFs, Cassandra has you covered. You’ll be able to make better-informed decisions without leaving any of your data on the table.

Here are some of Cassandra's key benefits and features:


Open Source

Open source: Modern software development organizations have overwhelmingly moved to adopt open-source technologies, starting with the Linux operating system and progressing to infrastructure for managing data. Open-source technologies are attractive because of their affordability and extensibility, as well as the flexibility to avoid vendor lock-in. Organizations adopting open source report higher speed of innovation and faster adoption.

Flexible, familiar interface: The Cassandra Query Language (CQL) is similar to SQL. That means most developers should have a fairly easy time becoming familiar with it. Learn more about CQL below.

High Performance

High performance: The majority of traditional databases feature a primary / secondary architecture. In these configurations, a single primary replica performs read and write operations, while secondary replicas are only able to perform read operations. Downsides to this architecture include increased latency, higher costs, and lower availability at scale. With Cassandra, no single node is in charge of replicating data across a cluster. Instead, every node is capable of performing all read and write operations. This improves performance and adds resiliency to the database.


Zero Downtime

Active everywhere with zero downtime: Since every Cassandra node is capable of performing read and write operations, data is quickly replicated across hybrid cloud environments and geographies. If a node fails, users will be automatically routed to the nearest healthy node. They won’t even notice that a node has been knocked offline because applications behave as designed even in the event of failure. As a result, applications are always available and data is always accessible and never lost. What’s more, Cassandra’s built-in repair services fix problems immediately when they occur—without any manual intervention. Productivity doesn’t need to take a hit if nodes fail.


scalability

Scalability: In traditional environments, scaling applications is a time-consuming and costly process typically accomplished by scaling vertically with more expensive machines. Cassandra enables you to scale horizontally by simply adding more nodes to the cluster. If, for example, four nodes can handle 200,000 transactions/second, eight nodes will be able to handle 400,000 transactions/second.

Seamless Replication

Seamless replication: Today’s leading enterprises are increasingly moving to multi-data center, hybrid cloud, and even multi-cloud deployments to take advantage of the strengths of each, without getting locked into any single provider’s ecosystem. Getting the most out of multi-cloud environments, however, starts with having an underlying cloud database that offers scalability, security, performance, and availability. For these reasons, it should come as no surprise that the cloud database market is expected to grow nearly 65% each year and reach $68.9 billion by 2022.


Understanding Cassandra’s Query Language (CQL)

Apache Cassandra

Where is Apache Cassandra headed next?

At DataStax, we’re working hard with the open-source community to build on Cassandra’s decade-plus maturity to solidify its position as the leading database for cloud-native applications.

Cassandra has traditionally been known as an extremely powerful database that stands up to the most demanding use cases, but also as difficult to learn and operate. DataStax is committed to working with the Cassandra community to make it easier to use, adopt, and extend for your needs.

Here are some of the ideas we’re exploring:

  • Providing simplified developer APIs including REST and GraphQL
  • Adding more SQL-like capabilities into CQL, including indexing, joins, and ACID, as well as full JSON support
  • Standard management APIs and an official, project-backed Kubernetes operator
  • Making the storage engine pluggable, along with other APIs, to allow customization of the database for different deployments and usage profiles

How can I get started?

If you’d like to learn more about Apache Cassandra, we have several resources here to get you started.

DataStax for Developers

DataStax for Developers

Learn how to succeed with Apache Cassandra™.

Visit now
DataStax Astra DB

Try DataStax Astra DB

Rapidly build cloud-native applications with DataStax Astra DB, a database-as-a-service built on Apache Cassandra.

Try for free
Apache Cassandra 4.0 White Paper

Apache Cassandra 4.0 White Paper

Get your free digital copy to harness Cassandra 4.0's performance and reliability

Get the White Paper