A collection of 17 posts
Apache Cassandra Lunch #94: StreamSets and Cassandra - Business Platform Team
In Cassandra Lunch #94, we discuss how to connectStreamSetsandCassandra! The live recording of Cassandra Lunch, which includes a more in-depth discussion and a demo, is embedded below in case you were
Next-Gen Data Movement Platform at PayPal
…using Apache Airflow scheduler and Apache Gobblin — a data integration framework open-sourced by LinkedIn.AsPayPal grows beyond 300 million users, we generate lots of data, both on our online (site)
Expero Blog | Building a Distributed Data Ingestion Pipeline
IntroductionOn a recent client engagement where we had to load and process data from several data sources, we were tasked with a broader mandate to develop a wholesale data loading strategy for a suit
Data Engineering Programs - Become a Data Engineer
NEW!Nanodegree ProgramData Engineering is the foundation for the new world of Big Data. Enroll now to build production-ready data infrastructure, an essential skill for advancing your data career.Enro
san089/Udacity-Data-Engineering-Projects
Project 1: Data Modeling with PostgresIn this project, we apply Data Modeling with Postgres and build an ETL pipeline using Python. A startup wants to analyze the data they've been collecting on songs
Flor91/Data-engineering-nanodegree
Projects done in theData Engineering Nanodegree by Udacity.comCourse 1: Data ModelingIntroduction to Data Modeling➔ Understand the purpose of data modeling➔ Identify the strengths and weaknesses of di
Blitzz.io
DEVELOPERSSimple to deploy, fault-tolerant, with zero-downtime, Blitzz Replicant is architected to be deployed on Kubernetes, which makes it the perfect choice for your new apps and services running i
scylladb/scylla-migrator
Make suresbtis installed on your machine, and runbuild.sh.Create aconfig.yamlfor your migration using the templateconfig.yamlin the repository root. Read the comments throughout carefully.The Scylla M