Testing Apache Cassandra workloads in DataStax AstraDB

Here is a quick list of resources to help get you started with migrating or testing a workload on DataStax Astra DB.

NoSqlBench

Documentation: https://docs.datastax.com/en/astra-serverless/docs/manage/upload/test-loading-data-with-nosqlbench.html

Documentation: https://docs.nosqlbench.io/

Use NoSQLBench to test loading data to Astra with a demo cluster, keyspace, and user.

cassandra-stress

Documentation: https://cassandra.apache.org/doc/4.0/cassandra/tools/cassandra_stress.html

If you're already using Apache Cassandra, then you're likely familiar with cassandra-stress. This tool allows you to run read, write, or mixed workloads against specific Cassandra data models.

Cassandra Harry

Git repo: https://github.com/apache/cassandra-harry

A testing tool for Apache Cassandra which generates reproducible, production-like workloads.

cql-proxy

Git repo: https://github.com/datastax/cql-proxy/

cql-proxy is an open source tool (written in Go) designed to function as an intermediary between Apache Cassandra and DataStax Astra DB. It can ensure that applications can quickly connect to Astra DB, without rewriting connection code to use the secure cloud bundle. If you're having trouble connecting to Astra DB from an application or other product, give the cql-proxy a look.

DSBulk

Documentation: https://docs.datastax.com/en/dsbulk/doc/dsbulk/reference/dsbulkCmd.html

Git repo: https://github.com/datastax/dsbulk

DSBulk is an open source tool which allows for data to be quickly imported and exported from Apache Cassandra and DataStax Astra DB. It also has a

count
function, which is one of the most-accurate ways to get a rowcount on a large table.