Cassandra.Link
The best knowledge base on Apache Cassandra®
Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.
A collection of 11 posts
GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.
12/2/2023
{{ message }} / Data_streaming_project PublicNotifications Fork 0 Star 0 Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and conta...
GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
/ e2e-data-engineering PublicNotifications Fork 5 Star 19 An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka...
Apache Cassandra Lunch #53: Cassandra ETL with Airflow and Spark - Business Platform Team
6/17/2022
In Apache Cassandra Lunch #53: Cassandra ETL with Airflow and Spark, we discussed how we can do Cassandra ETL processes using Airflow and Spark. The live recording of Cassandra Lunch, which includes a...
Apache Cassandra Lunch #52: Airflow and Cassandra for Cluster Management - Business Platform Team
6/16/2022
In Apache Cassandra Lunch #52: Airflow and Cassandra for Cluster Management, we discussed using Airflow to schedule tasks on a Cassandra cluster beyond what could be accomplished with the Cassandra pr...
Apache Cassandra Lunch #48: Airflow and Cassandra - Business Platform Team
6/13/2022
In Apache Cassandra Lunch #48: Airflow and Cassandra, we discussed using Airflow to manage interactions with Cassandra. Specifically this week we covered Airflow Operators, and how they could be used ...
Using Airflow with Astra · datastaxdevs/awesome-astra Wiki
2/17/2022
{{ message }} Notifications Fork 2 Star 2 © 2022 GitHub, Inc. You can’t perform that action at this time.
Next-Gen Data Movement Platform at PayPal
7/9/2021
…using Apache Airflow scheduler and Apache Gobblin — a data integration framework open-sourced by LinkedIn.As PayPal grows beyond 300 million users, we generate lots of data, both on our online (site)...
6/11/2021