Illustration Image

Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

9/7/2018

Reading time:1 min

Apache Cassandra, Part 1: Introduction and Key Features

by Haris Hasan

This series of posts present an introduction to Apache Cassandra. It discusses key Cassandra features, its core concepts, how it works under the hood, how it is different from other data stores, data modelling best practices with examples, and some tips & tricks.Cassandra is a popular open source NoSQL database. It is being successfully used in a variety of scenarios like analytics, time series analysis, monitoring, retail, e-commerce, etc. One common overarching theme where you find Cassandra in use are environments with high write volumes.Key Cassandra FeaturesHere are some of the key features of Cassandra.Distributed: Cassandra is built to run on a cluster of nodes to provide high availability, fault tolerance and scalability.Multi Master or Master Less: Many data stores e.g. MongoDB are based on a master slave architecture.http://mobicon.tistory.com/142All the writes goes on a master node and reads are executed on slaves. On the other hand, Cassandra works in a master-less or multi master mode.http://mobicon.tistory.com/142Writes are distributed among nodes using a hash function (more on this later) and reads are channeled onto specific nodes.High Write Availability: When a master node goes down, MongoDB stops taking new writes until rest of the nodes choose a new master. On the other hand, in Cassandra, if one node goes down, the writes are redirected towards other nodes and the system continues to operate.Linear Scaling: due to its multi master architecture, Cassandra is linearly scalable, doubling the number of nodes in a cluster can handle twice the writes.Design Time Schema: Cassandra requires defining schema and data types at design time. That’s not how Cassandra started, but it evolved and now you must define schema first.Hot Writes in RAM: Cassandra stores incoming writes in RAM to provide speedy performance (more on this later).AP system: Cassandra is considered highly available and partition tolerant system in terms of CAP theorem.Column family Store: Cassandra is neither a row based store nor column oriented store, its a column family store which is a different concept. (More on this later)Next: Apache Cassandra, Part 2: Cassandra vs MongoDB

Illustration Image

This series of posts present an introduction to Apache Cassandra. It discusses key Cassandra features, its core concepts, how it works under the hood, how it is different from other data stores, data modelling best practices with examples, and some tips & tricks.

Cassandra is a popular open source NoSQL database. It is being successfully used in a variety of scenarios like analytics, time series analysis, monitoring, retail, e-commerce, etc. One common overarching theme where you find Cassandra in use are environments with high write volumes.

Key Cassandra Features

Here are some of the key features of Cassandra.

  • Distributed: Cassandra is built to run on a cluster of nodes to provide high availability, fault tolerance and scalability.
  • Multi Master or Master Less: Many data stores e.g. MongoDB are based on a master slave architecture.
image
http://mobicon.tistory.com/142
  • All the writes goes on a master node and reads are executed on slaves. On the other hand, Cassandra works in a master-less or multi master mode.
image
http://mobicon.tistory.com/142
  • Writes are distributed among nodes using a hash function (more on this later) and reads are channeled onto specific nodes.
  • High Write Availability: When a master node goes down, MongoDB stops taking new writes until rest of the nodes choose a new master. On the other hand, in Cassandra, if one node goes down, the writes are redirected towards other nodes and the system continues to operate.
  • Linear Scaling: due to its multi master architecture, Cassandra is linearly scalable, doubling the number of nodes in a cluster can handle twice the writes.
  • Design Time Schema: Cassandra requires defining schema and data types at design time. That’s not how Cassandra started, but it evolved and now you must define schema first.
  • Hot Writes in RAM: Cassandra stores incoming writes in RAM to provide speedy performance (more on this later).
  • AP system: Cassandra is considered highly available and partition tolerant system in terms of CAP theorem.
  • Column family Store: Cassandra is neither a row based store nor column oriented store, its a column family store which is a different concept. (More on this later)

Next: Apache Cassandra, Part 2: Cassandra vs MongoDB

Related Articles

cassandra
article
time.series

Table Store Time Series Data Storage Architecture

John Doe

4/21/2020

cassandra
article

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt! 
We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company

Explore Related Topics

AllKafkaSparkScyllaSStableKubernetesApiGithubGraphQl

Explore Further

cassandra