Icon

All Blogs

Information and insight on all things NoSQL, Apache Cassandra™, and DataStax.

Icon

Company

Read news, customer stories, NoSQL trends, and more.

Icon

Developer

Read technical guides, community news, code snippets, expert tips, and more.

VIEW

Filter by

  • August
  • 2019
  • topics
  • 2019
developers
10 October, 2011

Leveled Compaction in Apache Cassandra

Authors: Jonathan Ellis
Apache Cassandra™

Introduction to Compaction Cassandra's log-structured storage engine enables Cassandra's amazing performance and features like application-transparent compression by turning all…

Learn More
developers
3 October, 2011

Hadoop MapReduce in the Cassandra Cluster

Authors: Eric Gilmore
Hadoop

NOTE: This post pre-dates our Brisk announcement. Brisk is a new distribution that enhances the Hadoop and Hive platform with scalable low-latency data capabilities. In our…

Learn More
developers
29 September, 2011

What’s New in Cassandra 1.0: Improved Memory and Disk Space Management

Authors: Jonathan Ellis

Cassandra 1.0 incorporates several improvements to how Cassandra's storage engine manages memory and disk space, bringing better performance and addressing some of the most common…

Learn More
developers
19 September, 2011

What’s new in Cassandra 1.0: Compression

Authors: Pavel Yaskevich

Cassandra 1.0 introduces support for data compression on a per-ColumnFamily basis, one of the most-requested features since the project started. Compression maximizes the storage…

Learn More
developers
1 September, 2011

Using the Cassandra Bulk Loader

Authors: Sylvain Lebresne
DataStax Bulk Loader

Bulk loading data in Cassandra has historically been difficult. Although Cassandra has had the BinaryMemtable interface from the very beginning, BinaryMemtable is hard to use and…

Learn More
developers
20 June, 2011

Brisk 1.0 Beta 2 Released

Authors: Kris Hahn

DataStax has released Brisk 1.0 Beta 2! You can download Brisk from the DataStax web site. New Features in Brisk 1.0 Beta 2 The following new features have been added in this…

Learn More
developers
17 June, 2011

What’s New in Cassandra 0.8, Part 2: Counters

Authors: Sylvain Lebresne

Deprecation warning This post covers the obsolete Cassandra 0.8. Modern Cassandra uses counters via CQL. Original post One of the features making its debut in Cassandra 0.8.0 is…

Learn More
developers
31 May, 2011

Understanding Hinted Handoff (in Cassandra 0.8)

Authors: Jonathan Ellis

This post describes hinted handoff for obsolete versions of Cassandra. For a description of how it works today, see this update Hinted Handoff is an optional part of writes whose…

Learn More
developers
4 April, 2011

Maximizing Cache Benefit with Cassandra

Authors: Eric Gilmore

Cassandra’s built-in key and row caches can provide very efficient data caching. Already, several Cassandra users who care deeply about read performance have leveraged these caching…

Learn More
developers
22 March, 2011

Deploying Cassandra across Multiple Data Centers

Authors: Eric Gilmore

Cassandra is designed as a distributed system, for deployment of large numbers of nodes across multiple data centers. Key features of Cassandra’s distributed architecture are…

Learn More
developers
6 January, 2011

What’s new in Cassandra 0.7: expiring columns

Authors: Sylvain Lebresne

Deprecation warning This post covers the obsolete Cassandra 0.7. Modern Cassandra manipulates expiring columns using CQL. Original post Sometimes, data comes with an expiration…

Learn More
developers
3 December, 2010

What’s new in Cassandra 0.7: Secondary indexes

Authors: Jonathan Ellis

Deprecation warning This post covers the obsolete Cassandra 0.7. Modern Cassandra manipulates indexes using CQL. Overview In Cassandra, indexes on column values are called "…

Learn More
developers
11 November, 2010

SPOF 0: Why Every Node in a Cassandra Cluster is the Same

Authors: DataStax

In a distributed system, component failure should be expected, particularly as the size of the infrastructure grows. Single points of failure in systems marketing themselves as…

Learn More
29 October, 2010

Why does Scalability matter, and how does Cassandra scale?

Authors: Matt Pfeil

The term scalability is inherently a bit amorphous and typically dependent on a specific use case. For the sake of this discussion, we'll define scalability as the ability to add…

Learn More
developers
22 October, 2010

What is persistence and why does it matter?

Authors: Matt Pfeil
Architecture

Understanding the meaning of persistence is important for evaluating different data store systems. Given the importance of the data store in most modern applications, making a…

Learn More
developers
12 October, 2010

What is Cassandra?

Authors: Matt Pfeil

Apache Cassandra is a high performance, scalable open source database designed for real time transactions and analytics. Cassandra offers sub millisecond read and write times - it's…

Learn More
developers
14 September, 2010

Cassandra Flume Sink and Logsandra Integration

Authors: Tyler Hobbs

Flume is an open source project described as a “distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data”. Because…

Learn More
developers
9 August, 2010

(Re)Introducing Hector: A Java-based Client for Apache Cassandra

Authors: DataStax

Deprecation warning Hector is a client for the obsolete Thrift API. New applications should use the DataStax Java Driver. Hector Hector, the first actively supported Java client…

Learn More
developers
27 July, 2010

Announcing RPMs for Cassandra

Authors: Jonathan Ellis

Apache Cassandra has long had support for debian packaging in its tree, but users of RPM-based distributions such as Red Hat Enterprise Linux or Fedora have had to roll their own…

Learn More