Testing Apache Cassandra workloads in DataStax AstraDB
Here is a quick list of resources to help get you started with migrating or testing a workload on DataStax Astra DB.
NoSqlBench
Documentation: https://docs.datastax.com/en/astra-serverless/docs/manage/upload/test-loading-data-with-nosqlbench.html
Documentation: https://docs.nosqlbench.io/
Use NoSQLBench to test loading data to Astra with a demo cluster, keyspace, and user.
cassandra-stress
Documentation: https://cassandra.apache.org/doc/4.0/cassandra/tools/cassandra_stress.html
If you're already using Apache Cassandra, then you're likely familiar with cassandra-stress. This tool allows you to run read, write, or mixed workloads against specific Cassandra data models.
Cassandra Harry
Git repo: https://github.com/apache/cassandra-harry
A testing tool for Apache Cassandra which generates reproducible, production-like workloads.
cql-proxy
Git repo: https://github.com/datastax/cql-proxy/
cql-proxy is an open source tool (written in Go) designed to function as an intermediary between Apache Cassandra and DataStax Astra DB. It can ensure that applications can quickly connect to Astra DB, without rewriting connection code to use the secure cloud bundle. If you're having trouble connecting to Astra DB from an application or other product, give the cql-proxy a look.
DSBulk
Documentation: https://docs.datastax.com/en/dsbulk/doc/dsbulk/reference/dsbulkCmd.html
Git repo: https://github.com/datastax/dsbulk
DSBulk is an open source tool which allows for data to be quickly imported and exported from Apache Cassandra and DataStax Astra DB. It also has a count
function, which is one of the most-accurate ways to get a rowcount on a large table.