DataStax OpsCenter Documentation

Managing backups and restoring from backups

Using OpsCenter Enterprise Edition, you can take, schedule, and manage backups across all registered clusters.

A backup is a snapshot of all on-disk data files (SSTable files) stored in the data directory. Backups are taken per keyspace and while the system is online. A backup first flushes all in-memory writes to disk, then makes a hard link of the SSTable files for each keyspace. Backups are stored in the snapshots directory of the column family that's being snapshotted. For example, /var/lib/cassandra/data/cfs/snapshots.

You must have enough free disk space on the node to accommodate making snapshots of your data files. A single snapshot requires little disk space. However, snapshots will cause your disk usage to grow more quickly over time because a snapshot prevents old obsolete data files from being deleted. OpsCenter Data Backups allows you to specify a schedule to remove old backups and prevent backups from being taken when disk space falls below a specified level.

Note

OpsCenter Data Backups does not show or manage manual snapshots taken using the nodetool snapshot command.

Scheduling a backup

To schedule a backup:

  1. In the OpsCenter Dashboard, click Data Backups.

  2. Click Schedule Backup.

  3. In Add Backup, select the backup parameters:

    • Select a Keyspace to backup - Select the keyspace that you want to back up.
    • Schedule - Select a frequency and timezone for your backup. GMT is the default timezone.
    • Cleanup - Choose a frequency to remove old backups. (If not specified, you should manually cleanup snapshots.)
  4. Click Save.

  5. To set the percentage of free disk space at which backups are prevented, click Configure and then enter the appropriate information.

    The percentage of free disk space that you set applies to all nodes in the cluster.

Detailed information about the backup is recorded in the Event Log.

Restoring from a backup

You can restore from any local backups that have been run by OpsCenter, but not from snapshots run from nodetool. You can pick any subset of column families that exist in the snapshot to restore.

To restore a backup:

  1. Click Data Backups.
  2. Find the backup you wish to restore in the list of backups.
  3. Click Restore.
  4. Choose which keyspaces to retore.
  5. (Optional) Choose individual column families within a chosen keyspace to restore.
  6. (Optional) Click the Truncate/delete existing data before restore checkbox.
  7. (Optional) Click the Throttle stream throughput at ____ MB chekbox and enter the desired throttling value.
  8. Click Restore.