Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 168 posts
Introduction - Setup lambda architecture example
8/30/2018
SETUP LAMBDA ARCHITECTURE EXAMPLE 1. Introduction - Setup lambda architecture example 24 March 2016Let’s see how to build a lambda architecture example (LAE) for processing huge log data.What...
Spark SQL Against Cassandra Example - DZone Database
8/28/2018
Spark SQL is awesome. It allows you to query any Resilient Distributed Dataset (RDD) using SQL (including data stored in Cassandra!).First thing to do is to create a SQLContext from your SparkContext...
The Correct Way to Connect Spark Shell to Cassandra
tl;drusing the cassandra connector in the spark-shell is fairly straightforward setting up the connection in a way that doens’t break the existing sc is not documented anywhere the correct solution is...
SnappyData, MemSQL-Spark & Cassandra-Spark: A Performance Benchmark
8/3/2018
There is a repo associated with this blog post hereThere is a blog post that explains the Ad Analytics code example used below hereIntroductionWe recently released a mixed workload example for Ad Anal...
SnappyDataInc/snappydata
SnappyData fuses Apache Spark with an in-memory database to deliver a data engine capable of processing streams, transactions and interactive analytics in a single cluster. The Challenge with Spark an...
instaclustr/sample-KafkaSparkCassandra
Introductory sample scala app using Apache Spark Streaming to accept data from Kafka and write a summary to Cassandra.This sample has been built with the following versions:Scala 2.11.8 Kafka 1.1 Spar...
Yannael/kafka-sparkstreaming-cassandra
This Dockerfile sets up a complete streaming environment for experimenting with Kafka, Spark streaming (PySpark), and Cassandra. It installs Kafka 0.10.2.1 Spark 2.1.1 for Scala 2.11 Cassandra 3.7 It...
killrweather/killrweather
8/2/2018
KillrWeather is a reference application (which we are constantly improving) showing how to easily leverage and integrate Apache Spark, Apache Cassandra, and Apache Kafka for fast, streaming computati...