Illustration Image

Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

9/6/2018

Reading time:2 min

Cassandra Hits One Million Writes Per Second on Google Compute Engine

by John Doe

Google is known for creating scalable high performance systems. In a recent blog post, we demonstrated how Google Cloud Platform can rapidly provision and scale networking load to handle one million requests per second. A fast front end without a fast backend has limited use, so we decided to demonstrate a backend serving infrastructure that could handle the same load. We looked at popular open source building blocks for cloud applications and choose Cassandra, a NoSQL database designed for scale and simplicity.Using 330 Google Compute Engine virtual machines, 300 1TB Persistent Disk volumes, Debian Linux, and Datastax Cassandra 2.2, we were able to construct a setup that can:sustain one million writes per second to Cassandra with a median latency of 10.3 ms and 95% completing under 23 mssustain a loss of ⅓ of the instances and volumes and still maintain the 1 million writes per second (though with higher latency)scale up and down linearly so that the configuration described can be used to create a cost effective solutiongo from nothing in existence to a fully configured and deployed instances hitting 1 million writes per second took just 70 minutes. A configured environment can achieve the same throughput in 20 minutes. Cassandra Quorum commit You can find the instructions on how to reproduce the results by following the setup instructions.ResultsWith 15,000 concurrent clients Cassandra was able to maintain 10.5ms median latency (8.3ms with 12,000 clients), and 95th latency percentile at 23ms. Here is how the solution scales as the number of concurrent clients grows:Below we show a graph of the throughput versus 95th percentile latency which quickly achieves very good response times after Cassandra initializes its internal state, and Java warms up its heap and memory mapped files table. This test was run longer than the minimal time required to hit over 1M writes per second in order to show the sustained throughput:In addition to looking at top end performance we also looked at resiliency. We removed ⅓ of the cluster nodes and it remained functional and serving more than 1M writes per second. Median latency held at 13.5ms, 95th percentile at 61.8ms, and 994.9th percentile at 1,333.5ms. We consider those numbers very good for a cluster in distress, proving Compute Engine and Cassandra can handle both spiky workloads and failures.ConclusionTuning the workload costs $5 per hour (on a 3 node cluster), and the minimal test required to hit one million writes per second takes 1 hour and 10 minutes at a cost of $330 USD when run in March 2014. Putting it all together, this means the Google Cloud Platform was able to sustain one million Cassandra writes per second at a cost of $0.07 USD per million writes.-Posted by Ivan Santa Maria Filho, Performance Engineering Lead

Illustration Image

Read this article if you want to know more about Cassandra Hits One Million Writes Per Second on Google Compute Engine

Related Articles

data
cassandra
database

Data Structures and Types Explained

Zac Amos

1/5/2024

json
api
stargate

Building Data Services with Apache Cassandra | DataStax

John Doe

3/21/2023

data
cassandra
database

The Curious Case Of Tombstones

John Doe

9/26/2018

data
cassandra

Understanding How Cassandra Stores Data

John Doe

9/26/2018

data
cassandra

Apache Drill Contribution Ideas - Apache Drill

John Doe

9/26/2018

data
cassandra

A rock and a hard place: Between ScyllaDB and Cassandra

George Anadiotis

9/6/2018

data.engineering
data
cassandra

Getting started with the Kafka Connect Cassandra Source

John Doe

5/12/2018

acid
open.source
cassandra

GitHub - pmcfadin/awesome-accord: Repository of all kinds of things to help you get up and running with ACID transactions on Apache Cassandra®

',p,m,c,f,a,d,i,n,'

1/16/2025

mongo
nocode
elasticsearch

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

ibagroup-eu

12/2/2024

mongo
nocode
elasticsearch

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

',i,b,a,g,r,o,u,p,-,e,u,'

12/2/2024

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt! 
We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company

​

Explore Related Topics

AllKafkaSparkScyllaSStableKubernetesApiGithubGraphQl

Explore Further

data

data
cassandra
database

Data Structures and Types Explained

Zac Amos

1/5/2024

json
api
stargate

Building Data Services with Apache Cassandra | DataStax

John Doe

3/21/2023

data
cassandra
database

The Curious Case Of Tombstones

John Doe

9/26/2018

data
cassandra

Understanding How Cassandra Stores Data

John Doe

9/26/2018

cassandra

acid
open.source
cassandra

GitHub - pmcfadin/awesome-accord: Repository of all kinds of things to help you get up and running with ACID transactions on Apache Cassandra®

',p,m,c,f,a,d,i,n,'

1/16/2025

mongo
nocode
elasticsearch

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

ibagroup-eu

12/2/2024

mongo
nocode
elasticsearch

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

',i,b,a,g,r,o,u,p,-,e,u,'

12/2/2024

migration
proxy
cassandra

GitHub - datastax/cql-proxy: A client-side CQL proxy/sidecar.

datastax

11/1/2024

spark

sstable
cassandra
spark

Spark and Cassandra’s SSTable loader

Arunkumar

11/1/2024

analytics
cassandra
spark

GitHub - apache/cassandra-analytics: Apache cassandra

apache

9/4/2024

cassandra
event.driven
spark

Build an Event-Driven Architecture with Apache Kafka, Apache Spark, and Apache Cassandra

DataStax

8/3/2024

python
cassandra
spark

GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.

andreia-negreira

12/2/2023

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company.

​
Contact Info

3 Washington Circle NW Suite 301 - Washington, D.C. 20037

support@anant.us

(855) 262-6526

Resources

Services

Careers

Events

Contact Us

Open Source Tools

Properties

Blog

Cassandra.Link

Cassandra.Tools

Anant Playbook

Awesome Cassandra

Follow Us

Github

Youtube

Twitter

Linkedin

Facebook

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company.

​
Illustration Image

© 2023 Anant Corporation

Apache, the Apache feather logo, Apache Cassandra, Cassandra, and the Cassandra logo, are either registered trademarks or trademarks of The Apache Software Foundation.