Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 1488 posts
Real Time Analytics With Spark Streaming and Cassandra
5/14/2019
Ryan SvihlaBlockedUnblockFollowFollowingSep 3, 2015Spark Streaming is a good tool to roll up transactions data into summaries as they enter the system. When paired with an easily idempotent data store...
mulesoft/mule-cassandradb-connector
5/13/2019
Anypoint Connector for Cassandra Database Apache Cassandra is a massively scalable open source non-relational database that offers continuous availability, linear scale performance, operational simpli...
Introducing Anypoint Connector for Cassandra
Check out Anypoint Connector for Cassandra in Exchange.Apache Cassandra is an open-source NoSQL database management system designed to handle large amounts of data across commodity hardware. Cassandra...
Datomic/CassandraCAS
4/5/2019
CassandraCAS Compare-and-swap in Cassandra Setup From Cassandra cqlsh: CREATE KEYSPACE IF NOT EXISTS datomic WITH replication = {'class': 'SimpleStrategy', 'replication_factor' : 3}; CREATE TABLE IF N...
Pearson uses Apache Spark Streaming for next generation adaptive learning platform
4/2/2019
This is a guest blog post from our friends at Pearson outlining their Apache Spark use case.Introduction of PearsonPearson is a British multinational publishing and education company headquartered in ...
Spark Streaming, Kafka and Cassandra Tutorial - Instaclustr
This tutorial builds on our basic “Getting Started with Instaclustr Spark and Cassandra” tutorial to demonstrate how to set up Apache Kafka and use it to send data to Spark Streaming where it is summa...
Building a Data Pipeline with Kafka, Spark Streaming and Cassandra | Baeldung
4/1/2019
1. OverviewApache Kafka is a scalable, high performance, low latency platform that allows reading and writing streams of data like a messaging system. We can start with Kafka in Java fairly easily. Sp...
uber/peloton
As compute clusters scale, making efficient use of cluster resources becomes very important. Peloton is a Unified Resource Scheduler to co-schedule mixed types of workloads such as batch, stateless an...