Illustration Image

Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

Cassandra Toolkit

View on GitHub

A hand-curated and tested list of tools that we recommend for building, managing, and monitoring Cassandra clusters.

Find more tools from the open source community at www.cassandra.tools.

Maintained by Rahul Singh of Anant. Feel free contact me if you'd like to collaborate on this and other tools. Please take a look!

Tools


  • Tableanalyzer / cassandra.vision

    • A python based cfstat data anlyzer with a future in being able to visualize other Cassandra / Distributed platform stats.
  • Tablesnap

    • Tablesnap is a script that uses inotify to monitor a directory for IN_MOVED_TO events and reacts to them by spawning a new thread to upload that file to Amazon S3, along with a JSON-formatted list of what other files were in the directory at the time of the copy.
    • To install and configure tablesnap follow Install tablesnap for AWS S3 backups section in dseauto/ansible/cassandra/README.md document.
  • node_exporter

    • Prometheus exporter for hardware and OS metrics exposed by *NIX kernels, written in Go with pluggable metric collectors.
    • To install and configure node_exporter follow Install node_exporter for exporting node metrics to prometheus server section in dseauto/ansible/cassandra/README.md document.
  • Cassandra Reaper

    • Reaper is an open source tool that aims to schedule and orchestrate repairs of Apache Cassandra clusters.
    • It improves the existing nodetool repair process by
      • Splitting repair jobs into smaller tunable segments.
      • Handling back-pressure through monitoring running repairs and pending compactions.
      • Adding ability to pause or cancel repairs and track progress precisely.

Credits


  • Rahul Singh - Concept, Curator, Creator of tableanalyzer
  • Sean Bogaard - Concept, Advisor, Curator
  • Ion Olaru - Testing, Documentation of 3rd Party Tools

Eventually we want compatability for the following items:

PlatformReceiveTransform
DSE 4.8.xDiagnostic TarballY
DSE 4.8.x/C* 2.1.xNodetoolY
DSE 4.8.x/C* 2.1.xSSHY
DSE 5.1.xDiagnostic TarballY
DSE 5.1.x/C* 3.1.xNodetoolY
DSE 5.1.x/C* 3.1.xSSHY
DSE 6.7.xDiagnostic TarballY
DSE 6.7.x/C* 4.0.xNodetoolY
DSE 6.7.x/C* 4.0.xSSHY
Scylla?TarballY
Elassandra?TarballY
YugaByte?TarballY
CosmosDB?TarballY
AWS MCS?TarballY

Resources


Backup

Cluster Management

Snapshot strategy (future changes)

  1. Take / keep a snapshot every 30 min for the latest 3 hours;
  2. Keep a snapshot every 6 hours for the last day, delete other snapshots;
  3. Keep a snapshot every day for the last month, delete other snapshots;