An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
12/2/2023
Reading time:N/A min
GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
by airscholar
/ e2e-data-engineering PublicNotificationsFork 5 Star 19 An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.www.youtube.com/watch?v=gqactrqkcry 19 stars 5 forks Activity Star Notifications
Related Articles
Checkout Planet Cassandra
Join Our Newsletter!
Sign up below to receive email updates and see what's going on with our company
Explore Further
python
cassandra
spark



