Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 168 posts
GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.
12/2/2023
{{ message }} / Data_streaming_project PublicNotifications Fork 0 Star 0 Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and conta...
GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
/ e2e-data-engineering PublicNotifications Fork 5 Star 19 An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka...
• Google Dataflow - Awesome-Astra
5/10/2023
Integrating Astra and Beam/DataflowAstra allows both bulk and real time operations through AstraDB and Astra Streaming. For each service there are multiple interfaces available and integration with Ap...
Dealing with Large Spark Partitions
2/17/2023
One of the biggest issues with working with Spark and Cassandra is dealing with large Partitions. There are several issues we need to overcome before we can really handle the challenge well. I’m going...
Apache Cassandra Lunch #84: Data & Analytics Platform: Cassandra, Spark, Kafka
11/4/2022
Can Spark Applications Coexist with NoSQL Databases? | Capital One
Apache SparkApache CassandraMongoDBThese are not unknown names in the tech industry. Each one of them has earned a commendable space in the field of distributed computing --Apache Spark as a unified a...
Migrate to Azure Managed Instance for Apache Cassandra using Apache Spark
8/18/2022
Article 04/01/2022 Where possible, we recommend using Apache Cassandra native replication to migrate data from your existing cluster into Azure Managed Instance for Apache Cassandra by configuring a h...
Apache Cassandra Lunch #72: Databricks and Cassandra - Business Platform Team
6/28/2022
In Apache Cassandra Lunch #72: Databricks and Cassandra, we discussed how we can connect Databricks and Cassandra. The live recording of Cassandra Lunch, which includes a more in-depth discussion and ...