Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

1/20/2023

Reading time:3 min

How to tweak the number of num_tokens (vnodes) in live Cassandra cluster

by Payal Kumari

Some clients have asked us to change the number of num_tokens as their requirement changes.For example lower number of num_tokens are recommended is using DSE search etc..The most important thing during this process is that the cluster stays up, and is healthy and fast. Anything we do needs to be deliberate and safe, as we have production traffic flowing through.The process includes adding a new DC with a changed number of num_tokens, decommissioning the old DC one by one, and letting Cassandra automatic mechanisms distribute the existing data into the new nodes.The below procedure is based on the assumption that you have 2 DC DC1 & DC2.1. Run repair to keep data consistent across clusterMake sure to run a full repair with nodetool repair. More detail about repairs can be found here. This ensures that all data is propagated from the datacenter which is being decommissioned.2. Add new DC DC3 and decommission old Datacenter DC1Step 1: Download and install a similar Cassandra version to the other nodes in the cluster, but do not start.How to stop CassandraNote: Don’t stop any node in DC1 unless DC3 added.If you used the Debian package, Cassandra starts automatically. You must stop the node and clear the data.Stop the node:Packaged installations: $ sudo service cassandra stopTarball installations: nodetool stopdaemonIf for some reason the previous command doesn’t work, find the Cassandra Java process ID (PID), and then kill the process using its PID number:$ ps auwx | grep cassandra$ sudo kill pidStep 2: Clear the data from the default directories once the node is down.sudo rm -rf /var/lib/cassandra/*Step 3: Configure the parameter by similar settings of other nodes in the cluster.Properties which should be set by comparing to other nodes.Cassandra.yaml:Seeds: This should include nodes from live DC because new nodes have to stream data from them.snitch: Keep it similar to the nodes in live DC.cluster_name: Similar to the nodes in another live DC.num_tokens: Number of vnodes required.initial_tokne: Make sure this is commented out.Set the local parameters below:auto_bootstrap: falselisten_address: Local to the noderpc_address: Local to the nodedata_directory: Local to the nodesaved_cache_directory: Local to the nodecommitlog_directory: Local to the nodeCassandra-rackdc.properties: Set the parameter for new datacenter and rack:dc: “dc name”rack: “rack name”Set the below configurations files, as needed:Cassandra-env.shLogback.xmlJvm.optionsStep 4: Start Cassandra on each node, one by one.Step 5: Now that all nodes are up and running, alter Keyspaces to set RF in a new datacenter with the number of replicas, as well.ALTER KEYSPACE Keyspace_name WITH REPLICATION = {‘class’ : ‘NetworkTopologyStrategy’, ‘dc1’ : 3, ‘dc2’ : 3, ‘dc3’ : 3};Step 6: Finally, now that the nodes are up and empty, we should run “nodetool rebuild” on each node to stream data from the existing datacenter.nodetool rebuild “Existing DC Name”Step 7: Remove “auto_bootstrap: false” from each Cassandra.yaml or set it to true after the complete process.auto_bootstrap: trueDecommission DC1:Now that we have added DC3 into a cluster, it’s time to decommission DC1. However, before decommissioning the datacenter in a production environment, the first step should be to prevent the client from connecting to it and ensure reads or writes do not query this datacenter.Step 1: Prevent clients from communicating with DC1First of all, ensure that the clients point to an existing datacenter.Set DCAwareRoundRobinPolicy to local to avoid any requests.Make sure to change QUORUM consistency level to LOCAL_QUORUM and ONE to LOCAL_ONE.Step 2: ALTER KEYSPACE to not have a replica in decommissioning DC.ALTER KEYSPACE “Keyspace_name” WITH REPLICATION = {‘class’ : ‘NetworkTopologyStrategy’, ‘dc2’ : 3, ‘dc3’ : 3};Step 3: Decommission each node using nodetool decommission.nodetool decommissionStep 4: Remove all data from data, saved caches, and commitlog directory after all nodes are decommissioned to reclaim disk space.sudo rm -rf “Data_directory”/“Saved_cache_directory”/“Commitlog_directory”Step 5: Finally, stop Cassandra as described in Step 1.Step 6: Decommission each node in DC2 by following the above procedure.3. Add new DC DC4 and decommission old DC2Hopefully, this blog post will help you to understand the procedure for changing the number of vnodes on a live Cluster. Keep in mind that bootstrapping/rebuilding/decommissioning process time depends upon data size.Want to talk with an expert? Schedule a call with our team to get the conversation started.

Illustration Image

Read this article if you want to know more about How to tweak the number of num_tokens (vnodes) in live Cassandra cluster

Some clients have asked us to change the number of num_tokens as their requirement changes.

For example lower number of num_tokens are recommended is using DSE search etc..

The most important thing during this process is that the cluster stays up, and is healthy and fast. Anything we do needs to be deliberate and safe, as we have production traffic flowing through.

The process includes adding a new DC with a changed number of num_tokens, decommissioning the old DC one by one, and letting Cassandra automatic mechanisms distribute the existing data into the new nodes.

The below procedure is based on the assumption that you have 2 DC DC1 & DC2.

1. Run repair to keep data consistent across cluster

Make sure to run a full repair with nodetool repair. More detail about repairs can be found here. This ensures that all data is propagated from the datacenter which is being decommissioned.

2. Add new DC DC3 and decommission old Datacenter DC1

Step 1: Download and install a similar Cassandra version to the other nodes in the cluster, but do not start.

How to stop Cassandra

Note: Don’t stop any node in DC1 unless DC3 added.

If you used the Debian package, Cassandra starts automatically. You must stop the node and clear the data.
Stop the node:
Packaged installations: $ sudo service cassandra stop
Tarball installations: nodetool stopdaemon
If for some reason the previous command doesn’t work, find the Cassandra Java process ID (PID), and then kill the process using its PID number:
$ ps auwx | grep cassandra
$ sudo kill pid

Step 2: Clear the data from the default directories once the node is down.

sudo rm -rf /var/lib/cassandra/*

Step 3: Configure the parameter by similar settings of other nodes in the cluster.

Properties which should be set by comparing to other nodes.

Cassandra.yaml:

Seeds: This should include nodes from live DC because new nodes have to stream data from them.
snitch: Keep it similar to the nodes in live DC.
cluster_name: Similar to the nodes in another live DC.
num_tokens: Number of vnodes required.
initial_tokne: Make sure this is commented out.

Set the local parameters below:

auto_bootstrap: false
listen_address: Local to the node
rpc_address: Local to the node
data_directory: Local to the node
saved_cache_directory: Local to the node
commitlog_directory: Local to the node

Cassandra-rackdc.properties: Set the parameter for new datacenter and rack:

dc: “dc name”
rack: “rack name”

Set the below configurations files, as needed:

Cassandra-env.sh

Logback.xml

Jvm.options

Step 4: Start Cassandra on each node, one by one.

Step 5: Now that all nodes are up and running, alter Keyspaces to set RF in a new datacenter with the number of replicas, as well.

ALTER KEYSPACE Keyspace_name WITH REPLICATION = {‘class’ : ‘NetworkTopologyStrategy’, ‘dc1’ : 3, ‘dc2’ : 3, ‘dc3’ : 3};

Step 6: Finally, now that the nodes are up and empty, we should run “nodetool rebuild” on each node to stream data from the existing datacenter.

nodetool rebuild “Existing DC Name”

Step 7: Remove “auto_bootstrap: false” from each Cassandra.yaml or set it to true after the complete process.

auto_bootstrap: true

Decommission DC1:

Now that we have added DC3 into a cluster, it’s time to decommission DC1. However, before decommissioning the datacenter in a production environment, the first step should be to prevent the client from connecting to it and ensure reads or writes do not query this datacenter.

Step 1: Prevent clients from communicating with DC1

First of all, ensure that the clients point to an existing datacenter.
Set DCAwareRoundRobinPolicy to local to avoid any requests.

Make sure to change QUORUM consistency level to LOCAL_QUORUM and ONE to LOCAL_ONE.

Step 2: ALTER KEYSPACE to not have a replica in decommissioning DC.

ALTER KEYSPACE “Keyspace_name” WITH REPLICATION = {‘class’ : ‘NetworkTopologyStrategy’, ‘dc2’ : 3, ‘dc3’ : 3};

Step 3: Decommission each node using nodetool decommission.

nodetool decommission

Step 4: Remove all data from data, saved caches, and commitlog directory after all nodes are decommissioned to reclaim disk space.

sudo rm -rf “Data_directory”/“Saved_cache_directory”/“Commitlog_directory”

Step 5: Finally, stop Cassandra as described in Step 1.

Step 6: Decommission each node in DC2 by following the above procedure.

3. Add new DC DC4 and decommission old DC2

Hopefully, this blog post will help you to understand the procedure for changing the number of vnodes on a live Cluster. Keep in mind that bootstrapping/rebuilding/decommissioning process time depends upon data size.

Want to talk with an expert? Schedule a call with our team to get the conversation started.

Related Articles

GitHub - pmcfadin/awesome-accord: Repository of all kinds of things to help you get up and running with ACID transactions on Apache Cassandra®

pmcfadin

1/16/2025

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

ibagroup-eu

12/2/2024

GitHub - apache/cassandra-analytics: Apache cassandra

apache

9/4/2024

DataStax Hyper-Converged Database: The Future of Data Infrastructure Is Here | DataStax

Patrick McFadin

7/11/2024

troubleshooting

GitHub - arodrime/Montecristo: Datastax Cluster Health Check Tooling

arodrime

4/3/2024

GitHub - michelderu/chat-with-your-data-in-cassandra: Chat with your data stored in DataStax Enterprise, Astra DB and Apache Cassandra - In Natural Language!

John Doe

3/26/2024

GitHub - jhipster/jhipster-sample-app-cassandra: This is a sample application created with JHipster, with the Cassandra option

jhipster

3/7/2024

LoopBack

John Doe

3/7/2024

GitHub - dreamfactorysoftware/df-cassandra: The DreamFactory Cassandra service

dreamfactorysoftware

3/7/2024

GitHub - dreamfactorysoftware/dreamfactory: DreamFactory API Management Platform

John Doe

3/7/2024

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt!  We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company

Explore Related Topics

AllKafkaSparkScyllaSStableKubernetesApiGithubGraphQl

Explore Further

spark

GitHub - apache/cassandra-analytics: Apache cassandra

apache

9/4/2024

GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.

andreia-negreira

12/2/2023

GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

airscholar

12/2/2023

Apache Cassandra Lunch #84: Data & Analytics Platform: Cassandra, Spark, Kafka

John Doe

11/4/2022

kafka

GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.

andreia-negreira

12/2/2023

GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

airscholar

12/2/2023

GitHub - princebhatt9588/Stock_Market_Real_Time_Data_Pipeline_Project_with-Apache-Kafka-and-Cassandra: This app utilizes Python, Apache Kafka, and Cassandra to fetch and process real-time stock market data, providing valuable insights for investors and traders.

princebhatt9588

12/2/2023

Apache Cassandra Lunch #84: Data & Analytics Platform: Cassandra, Spark, Kafka

John Doe

11/4/2022

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company.

Contact Info

3 Washington Circle NW Suite 301 - Washington, D.C. 20037

support@anant.us

(855) 262-6526

Resources

Services

Careers

Events

Contact Us

Open Source Tools

Properties

Blog

Cassandra.Link

Cassandra.Tools

Anant Playbook

Awesome Cassandra

Follow Us

Github

Youtube

Twitter

Linkedin

Facebook

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company.

Illustration Image

Illustration Image

© 2023 Anant Corporation

Apache, the Apache feather logo, Apache Cassandra, Cassandra, and the Cassandra logo, are either registered trademarks or trademarks of The Apache Software Foundation.