Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 168 posts
Apache Cassandra Lunch #65: Spark Cassandra Connector Pushdown - Business Platform Team
6/27/2022
In Apache Cassandra Lunch #65: Spark Cassandra Connector Pushdown, we discussed Spark predicate pushdown in the context of the Spark Cassandra connector. The live recording of Cassandra Lunch, which i...
Apache Cassandra Lunch #54: Machine Learning with Spark + Cassandra Part 2
6/18/2022
In Apache Cassandra Lunch #54: Machine Learning with Spark + Cassandra, we will discuss how you can use Apache Spark and Apache Cassandra to perform additional basic Machine Learning tasks. The live r...
Apache Cassandra Lunch #53: Cassandra ETL with Airflow and Spark - Business Platform Team
6/17/2022
In Apache Cassandra Lunch #53: Cassandra ETL with Airflow and Spark, we discussed how we can do Cassandra ETL processes using Airflow and Spark. The live recording of Cassandra Lunch, which includes a...
Apache Cassandra Lunch #50: Machine Learning with Spark + Cassandra - Business Platform Team
6/15/2022
In Apache Cassandra Lunch #50: Machine Learning with Spark + Cassandra, we will discuss how you can use Apache Spark and Apache Cassandra to perform basic Machine Learning tasks. The live recording of...
Apache Cassandra Lunch #49: Spark SQL for Cassandra Data Operations - Business Platform Team
6/14/2022
In Apache Cassandra Lunch #49: Spark SQL for Cassandra Data Operations, we discuss how we can use Spark SQL for Cassandra data operations. The live recording of Cassandra Lunch, which includes a more ...
Apache Cassandra Lunch #46: Apache Spark Jobs in Scala for Cassandra Data Operations - Business Platform Team
6/12/2022
In Apache Cassandra Lunch #46: Apache Spark Jobs in Scala for Cassandra Data Operations, we discuss how we can do Apache Spark jobs in Scala Cassandra data operations. The live recording of Cassandra ...
Building a Data Pipeline with Kafka, Spark Streaming and Cassandra | Baeldung
5/26/2022
1. OverviewApache Kafka is a scalable, high performance, low latency platform that allows reading and writing streams of data like a messaging system. We can start with Kafka in Java fairly easily. Sp...
How we build a robust analytics platform using Spark, Kafka and Cassandra
3/11/2022
In today’s online world, supply chain is one of the most important pillars of any online shop. Not just quality products, but customers also want swift deliveries. This requires maintaining item avail...