Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 1475 posts
Increasing the number of tokens in Cassandra 2.2
4/29/2021
11 minute read I wrote a blog post a few months ago detailing my experience with Cassandra and tokens. This is a follow-up post on the outcome of that issue. Note: In these posts, I’m using toke...
Cassandra Operations and Tuning | Apache Cassandra Performance Tuning – DMN Big Data
Cassandra Operations And Performance Tuning In this topic, i will cover the basics of general Apache Cassandra performance tuning: when to do performance tuning, how to avoid and identify p...
[PromCon Recap] Two Households, Both Alike in Dignity: Cortex and Thanos
This blog post is a writeup of the presentation Bartek Plotka and I gave at PromCon 2019.Cortex is a horizontally scalable, clustered Prometheus implementation aimed at giving users a global view of...
Apache Cassandra Lunch #46: Apache Spark Jobs in Scala for Cassandra Data Operations - Business Platform Team
4/28/2021
In Apache Cassandra Lunch #46: Apache Spark Jobs in Scala for Cassandra Data Operations, we discuss how we can do Apache Spark jobs in Scala Cassandra data operations. The live recording of Cassandra ...
WitFoo Tests Cassandra 4.0 for Performance Issues
4/26/2021
WitFoo Precinct persists and replicates data on big-data NoSQL platform Apache Cassandra. Precinct 6.1.3 is built on Cassandra 3.11. In preparation for upgrade to Cassandra 4.0, the following lab & pr...
Kubernetes Data Simplicity: Getting started with K8ssandra
You might have heard about the K8ssandra project and want to start contributing, or maybe you want to start using all of its features. If you aren’t familiar with K8ssandra (pronounced like “Kate Sand...
Open-sourcing a 10x reduction in Apache Cassandra tail latency
4/25/2021
Instagram EngineeringFollowMar 5, 2018 · 6 min readAt Instagram, we have one of the world’s largest deployments of the Apache Cassandra database. We began using Cassandra in 2012 to replace Redis and ...
The premier open source Data Quality solution
The premier open source Data Quality solution.What is DataCleaner?Data profilingThe heart of DataCleaner is a strong data profiling engine for discovering and analyzing the quality of your data. Find ...