Illustration Image

Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

1/15/2018

Reading time:6 min

Alluxio Mesos Meetup - SMACK to SMAACK

by Alluxio, Inc.

Alluxio Mesos Meetup - SMACK to SMAACK SlideShare Explore You Successfully reported this slideshow.Alluxio Mesos Meetup - SMACK to SMAACKUpcoming SlideShareLoading in …5× 0 Comments 1 Like Statistics Notes Streaming Analytics , Technology Manager at Trivadis at Trivadis No DownloadsNo notes for slide 1. © 2016 Mesosphere, Inc. All Rights Reserved.From SMACK toSMAACKAlluxio meets DC/OSJörg Schad, MesosphereAdit Madan, Alluxio#smack @Alluxio @dcos @joerg_schad @madanadit 2. © 2017 Mesosphere, Inc. All Rights Reserved.20% OFFMCDCOS20September 13th - 15th● Dedicated Tracks● MesosCon University● Town Halls● HackathonAccelerating Spark workloads in a Mesosenvironment with Alluxio, 09/15, 11AM 3. © 2017 Mesosphere, Inc. All Rights Reserved. 3Fast DataBatch Event ProcessingMicro-BatchDays Hours Minutes Seconds MicrosecondsSolves problems using predictive and prescriptive analyticsReports what has happened using descriptive analyticsPredictive User InterfaceReal-time Pricing and Routing Real-time AdvertisingBilling, Chargeback Product recommendations 4. © 2017 Mesosphere, Inc. All Rights Reserved. 4The SMACK StackEVENTSUbiquitous data streamsfrom connected devicesINGESTApache KafkaSTOREApache SparkANALYZEApache CassandraACTAkkaIngest millions of eventsper secondDistributed & highlyscalable databaseReal-time and batchprocess dataVisualize data and builddata driven applicationsMesos/ DC/OSSensorsDevicesClients 5. © 2017 Mesosphere, Inc. All Rights Reserved. 5Datacenter 6. © 2017 Mesosphere, Inc. All Rights Reserved. 6NAIVE APPROACHTypical Datacentersiloed, over-provisioned servers,low utilizationIndustry Average12-15% utilizationmySQLmicroserviceCassandraSpark/HadoopKafka 7. © 2017 Mesosphere, Inc. All Rights Reserved. 7 8. © 2017 Mesosphere, Inc. All Rights Reserved. 8MULTIPLEXING OF DATA, SERVICES, USERS, ENVIRONMENTSTypical Datacentersiloed, over-provisioned servers,low utilizationMesos/ DC/OSautomated schedulers, workload multiplexing onto thesame machinesmySQLmicroserviceCassandraSpark/HadoopKafka 9. Datacenter Operating System (DC/OS)Distributed Systems Kernel (Mesos)DC/OS ENABLES MODERN DISTRIBUTED APPSBig Data + Analytics EnginesMicroservices (in containers)StreamingBatchMachine LearningAnalyticsFunctions &LogicSearchTime SeriesSQL / NoSQLDatabasesModern App ComponentsAny Infrastructure (Physical, Virtual, Cloud)9 10. © 2017 Mesosphere, Inc. All Rights Reserved. 10The SMACK StackEVENTSUbiquitous data streamsfrom connected devicesINGESTApache KafkaSTOREApache SparkANALYZEApache CassandraACTAkkaIngest millions of eventsper secondDistributed & highlyscalable databaseReal-time and batchprocess dataVisualize data and builddata driven applicationsMesos/ DC/OSSensorsDevicesClients 11. © 2017 Mesosphere, Inc. All Rights Reserved. 11The SMACK StackEVENTSUbiquitous data streamsfrom connected devicesINGESTApache KafkaSTOREApache SparkANALYZEApache CassandraACTAkkaIngest millions of eventsper secondDistributed & highlyscalable databaseReal-time and batchprocess dataVisualize data and builddata driven applicationsMesos/ DC/OSSensorsDevicesClients 12. © 2016 Mesosphere, Inc. All Rights Reserved.BIG DATA ECOSYSTEM YESTERDAY© 2017 Alluxio 12 13. © 2016 Mesosphere, Inc. All Rights Reserved.BIG DATA ECOSYSTEM TODAY© 2017 Alluxio13 14. © 2016 Mesosphere, Inc. All Rights Reserved.BIG DATA ECOSYSTEM ISSUES© 2017 Alluxio14 15. © 2017 Mesosphere, Inc. All Rights Reserved. 15The SMAACK StackEVENTSUbiquitous data streamsfrom connected devicesINGESTApache KafkaSTOREApache SparkANALYZEApache CassandraACTAkkaIngest millions of eventsper secondDistributed & highlyscalable databaseReal-time and batchprocess dataVisualize data and builddata driven applicationsMesos/ DC/OSSensorsDevicesClientsAlluxio 16. © 2017 Mesosphere, Inc. All Rights Reserved. 16© 2017 Alluxio 17. © 2016 Mesosphere, Inc. All Rights Reserved.BIG DATA ECOSYSTEM WITH ALLUXIOFUSE Compatible FileSystem InterfaceHadoop Compatible FileSystem InterfaceNative Key-ValueInterfaceNative File SystemInterfaceHDFS Interface Amazon S3 Interface Swift Interface GlusterFS Interface© 2017 Alluxio 17 18. © 2016 Mesosphere, Inc. All Rights Reserved.BIG DATA ECOSYSTEM WITH ALLUXIOFUSE Compatible FileSystem InterfaceHadoop Compatible FileSystem InterfaceNative Key-ValueInterfaceNative File SystemInterfaceHDFS Interface Amazon S3 Interface Swift Interface GlusterFS InterfaceEnabling Application to Access Data from anyStorage System at Memory-speed© 2017 Alluxio 18 19. © 2016 Mesosphere, Inc. All Rights Reserved.WHY ALLUXIO© 2017 AlluxioCo-located compute and data with memory-speed access to dataVirtualized across different storage systems under a unified namespaceScale-out architectureFile system API, software only19 20. © 2016 Mesosphere, Inc. All Rights Reserved.ALLUXIO BENEFITS© 2017 AlluxioUnificationNew workflows acrossany data in any storagesystemOrders of magnitudeimprovement in runtimeChoice in compute andstorage – grow eachindependently, buyonly what is neededPerformance Flexibility20 21. © 2017 Mesosphere, Inc. All Rights Reserved. 21© 2017 Alluxio 22. © 2016 Mesosphere, Inc. All Rights Reserved. 22WHY DATA SERVICES ON DC/OS?On-demand provisioning123Simplified operationsElastic data infrastructure● Single command install of services● Runtime software upgrade● Runtime application settings update● Monitoring & metrics● Managed persistent storage volumes● Data services and containerized apps share resources● Deploy instances with different versions on the sameinfrastructure● Resize instances● Add more instances© 2017 Alluxio 23. © 2016 Mesosphere, Inc. All Rights Reserved. 23ALLUXIO ON MESOSPHERE DC/OSFast, On-demand Unified Data at Memory Speed for AnalyticsAlluxioMesosphere DC/OSAny InfrastructureBuild apps once in DC/OS, andrun anywhereRuns distributed apps anywhere assimply as running apps on your laptopUnify Data at Memory Speed Unify Data at Memory Speed© 2017 Alluxio 24. © 2016 Mesosphere, Inc. All Rights Reserved. 24ALLUXIO ON MESOSPHERE DC/OSFast, On-demand Unified Data at Memory Speed for Analytics© 2017 Alluxio 25. © 2016 Mesosphere, Inc. All Rights Reserved.WHY ALLUXIO ON MESOSPHERE DC/OS?● Without Mesosphere DC/OS, provisioning of infrastructure is tedious○ Mesosphere DC/OS automates app & cluster provisioning, management & elastic scaling● Alluxio brings○ A unified view of data across disparate storage systems○ High performance & predictable SLA for analytics workloads● Benefits include:○ Process data in your existing cluster faster with Spark and other analytics frameworks○ Process data from hybrid cloud storage systems (HDFS, S3, On-prem Object Stores etc)© 2017 Alluxio 25 26. © 2016 Mesosphere, Inc. All Rights Reserved. 26BIG DATA STACK WITH ALLUXIO ON MESOSPHERE DC/OSFast, On-demand Unified Data at Memory Speed for AnalyticsMesosContainer Orchestration Management & Monitoring Tools Apps UniverseSecurity Advanced Operations Multitenancy Adv. Network & StorageUnifying Data at Memory Speed© 2017 Alluxio 27. © 2017 Mesosphere, Inc. All Rights Reserved. 27© 2017 AlluxioDEMO 28. © 2016 Mesosphere, Inc. All Rights Reserved.WHAT HAPPENED?● Alluxio scheduler (developed using the DC/OS SDK) launched as a Marathon application○ Marathon manages and restarts the scheduler in case of failures○ Scheduler consists of YAML + scripting● Alluxio scheduler launched master and worker processes○ Scheduler manages the configured number of instances even w/ failures● Configuration changes take effect on the fly○ Scaled up the worker instances© 2017 Alluxio 28 29. © 2016 Mesosphere, Inc. All Rights Reserved.GET STARTED TODAYRead:● Mesosphere Blog: http://ow.ly/ou0530ax9aM● Alluxio Blog: http://ow.ly/ILOZ30ax8YETry it out:● Install Alluxio from DC/OS UniverseQuestions?© 2017 Alluxio 29 Recommended Teacher Tech Tips WeeklyOnline Course - LinkedIn Learning Insights from a Content MarketerOnline Course - LinkedIn Learning Educational Technology for Student SuccessOnline Course - LinkedIn Learning The Architecture of Decoupling Compute and Storage with AlluxioAlluxio, Inc. Best Practices for Using Alluxio with SparkAlluxio, Inc. Spark Pipelines in the Cloud with AlluxioAlluxio, Inc. Accelerating Spark Workloads in a Mesos Environment with AlluxioAlluxio, Inc. Accelerating Spark Workloads in an Apache Mesos Environment with AlluxioAlluxio, Inc. Best Practices for Using Alluxio with SparkAlluxio, Inc. Best Practices for Using Alluxio with SparkAlluxio, Inc. About Blog Terms Privacy Copyright LinkedIn Corporation © 2018 Public clipboards featuring this slideNo public clipboards found for this slideSelect another clipboard ×Looks like you’ve clipped this slide to already.Create a clipboardYou just clipped your first slide! Clipping is a handy way to collect important slides you want to go back to later. Now customize the name of a clipboard to store your clips. Description Visibility Others can see my Clipboard

Illustration Image
Alluxio Mesos Meetup - SMACK to SMAACK

Successfully reported this slideshow.

Alluxio Mesos Meetup - SMACK to SMAACK
© 2016 Mesosphere, Inc. All Rights Reserved.
From SMACK to
SMAACK
Alluxio meets DC/OS
Jörg Schad, Mesosphere
Adit Madan, A...
© 2017 Mesosphere, Inc. All Rights Reserved.
20% OFF
MCDCOS20
September 13th - 15th
● Dedicated Tracks
● MesosCon Universi...
© 2017 Mesosphere, Inc. All Rights Reserved. 3
Fast Data
Batch Event ProcessingMicro-Batch
Days Hours Minutes Seconds Micr...
© 2017 Mesosphere, Inc. All Rights Reserved. 4
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGES...
© 2017 Mesosphere, Inc. All Rights Reserved. 5
Datacenter
© 2017 Mesosphere, Inc. All Rights Reserved. 6
NAIVE APPROACH
Typical Datacenter
siloed, over-provisioned servers,
low uti...
© 2017 Mesosphere, Inc. All Rights Reserved. 7
© 2017 Mesosphere, Inc. All Rights Reserved. 8
MULTIPLEXING OF DATA, SERVICES, USERS, ENVIRONMENTS
Typical Datacenter
silo...
Datacenter Operating System (DC/OS)
Distributed Systems Kernel (Mesos)
DC/OS ENABLES MODERN DISTRIBUTED APPS
Big Data + An...
© 2017 Mesosphere, Inc. All Rights Reserved. 10
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGE...
© 2017 Mesosphere, Inc. All Rights Reserved. 11
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGE...
© 2016 Mesosphere, Inc. All Rights Reserved.
BIG DATA ECOSYSTEM YESTERDAY
© 2017 Alluxio 12
© 2016 Mesosphere, Inc. All Rights Reserved.
BIG DATA ECOSYSTEM TODAY
© 2017 Alluxio
…
…
13
© 2016 Mesosphere, Inc. All Rights Reserved.
BIG DATA ECOSYSTEM ISSUES
© 2017 Alluxio
…
…
14
© 2017 Mesosphere, Inc. All Rights Reserved. 15
The SMAACK Stack
EVENTS
Ubiquitous data streams
from connected devices
ING...
© 2017 Mesosphere, Inc. All Rights Reserved. 16© 2017 Alluxio
© 2016 Mesosphere, Inc. All Rights Reserved.
BIG DATA ECOSYSTEM WITH ALLUXIO
…
…
FUSE Compatible File
System Interface
Had...
© 2016 Mesosphere, Inc. All Rights Reserved.
BIG DATA ECOSYSTEM WITH ALLUXIO
…
…
FUSE Compatible File
System Interface
Had...
© 2016 Mesosphere, Inc. All Rights Reserved.
WHY ALLUXIO
© 2017 Alluxio
Co-located compute and data with memory-speed acce...
© 2016 Mesosphere, Inc. All Rights Reserved.
ALLUXIO BENEFITS
© 2017 Alluxio
Unification
New workflows across
any data in ...
© 2017 Mesosphere, Inc. All Rights Reserved. 21© 2017 Alluxio
© 2016 Mesosphere, Inc. All Rights Reserved. 22
WHY DATA SERVICES ON DC/OS?
On-demand provisioning1
2
3
Simplified operati...
© 2016 Mesosphere, Inc. All Rights Reserved. 23
ALLUXIO ON MESOSPHERE DC/OS
Fast, On-demand Unified Data at Memory Speed f...
© 2016 Mesosphere, Inc. All Rights Reserved. 24
ALLUXIO ON MESOSPHERE DC/OS
Fast, On-demand Unified Data at Memory Speed f...
© 2016 Mesosphere, Inc. All Rights Reserved.
WHY ALLUXIO ON MESOSPHERE DC/OS?
● Without Mesosphere DC/OS, provisioning of ...
© 2016 Mesosphere, Inc. All Rights Reserved. 26
BIG DATA STACK WITH ALLUXIO ON MESOSPHERE DC/OS
Fast, On-demand Unified Da...
© 2017 Mesosphere, Inc. All Rights Reserved. 27© 2017 Alluxio
DEMO
© 2016 Mesosphere, Inc. All Rights Reserved.
WHAT HAPPENED?
● Alluxio scheduler (developed using the DC/OS SDK) launched a...
© 2016 Mesosphere, Inc. All Rights Reserved.
GET STARTED TODAY
Read:
● Mesosphere Blog: http://ow.ly/ou0530ax9aM
● Alluxio...

Upcoming SlideShare

Loading in …5

×

  1. 1. © 2016 Mesosphere, Inc. All Rights Reserved. From SMACK to SMAACK Alluxio meets DC/OS Jörg Schad, Mesosphere Adit Madan, Alluxio #smack @Alluxio @dcos @joerg_schad @madanadit
  2. 2. © 2017 Mesosphere, Inc. All Rights Reserved. 20% OFF MCDCOS20 September 13th - 15th ● Dedicated Tracks ● MesosCon University ● Town Halls ● Hackathon Accelerating Spark workloads in a Mesos environment with Alluxio, 09/15, 11AM
  3. 3. © 2017 Mesosphere, Inc. All Rights Reserved. 3 Fast Data Batch Event ProcessingMicro-Batch Days Hours Minutes Seconds Microseconds Solves problems using predictive and prescriptive analyticsReports what has happened using descriptive analytics Predictive User InterfaceReal-time Pricing and Routing Real-time AdvertisingBilling, Chargeback Product recommendations
  4. 4. © 2017 Mesosphere, Inc. All Rights Reserved. 4 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Spark ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Mesos/ DC/OS Sensors Devices Clients
  5. 5. © 2017 Mesosphere, Inc. All Rights Reserved. 5 Datacenter
  6. 6. © 2017 Mesosphere, Inc. All Rights Reserved. 6 NAIVE APPROACH Typical Datacenter siloed, over-provisioned servers, low utilization Industry Average 12-15% utilization mySQL microservice Cassandra Spark/Hadoop Kafka
  7. 7. © 2017 Mesosphere, Inc. All Rights Reserved. 7
  8. 8. © 2017 Mesosphere, Inc. All Rights Reserved. 8 MULTIPLEXING OF DATA, SERVICES, USERS, ENVIRONMENTS Typical Datacenter siloed, over-provisioned servers, low utilization Mesos/ DC/OS automated schedulers, workload multiplexing onto the same machines mySQL microservice Cassandra Spark/Hadoop Kafka
  9. 9. Datacenter Operating System (DC/OS) Distributed Systems Kernel (Mesos) DC/OS ENABLES MODERN DISTRIBUTED APPS Big Data + Analytics EnginesMicroservices (in containers) Streaming Batch Machine Learning Analytics Functions & Logic Search Time Series SQL / NoSQL Databases Modern App Components Any Infrastructure (Physical, Virtual, Cloud) 9
  10. 10. © 2017 Mesosphere, Inc. All Rights Reserved. 10 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Spark ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Mesos/ DC/OS Sensors Devices Clients
  11. 11. © 2017 Mesosphere, Inc. All Rights Reserved. 11 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Spark ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Mesos/ DC/OS Sensors Devices Clients
  12. 12. © 2016 Mesosphere, Inc. All Rights Reserved. BIG DATA ECOSYSTEM YESTERDAY © 2017 Alluxio 12
  13. 13. © 2016 Mesosphere, Inc. All Rights Reserved. BIG DATA ECOSYSTEM TODAY © 2017 Alluxio … … 13
  14. 14. © 2016 Mesosphere, Inc. All Rights Reserved. BIG DATA ECOSYSTEM ISSUES © 2017 Alluxio … … 14
  15. 15. © 2017 Mesosphere, Inc. All Rights Reserved. 15 The SMAACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Spark ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Mesos/ DC/OS Sensors Devices Clients Alluxio
  16. 16. © 2017 Mesosphere, Inc. All Rights Reserved. 16© 2017 Alluxio
  17. 17. © 2016 Mesosphere, Inc. All Rights Reserved. BIG DATA ECOSYSTEM WITH ALLUXIO … … FUSE Compatible File System Interface Hadoop Compatible File System Interface Native Key-Value Interface Native File System Interface HDFS Interface Amazon S3 Interface Swift Interface GlusterFS Interface © 2017 Alluxio 17
  18. 18. © 2016 Mesosphere, Inc. All Rights Reserved. BIG DATA ECOSYSTEM WITH ALLUXIO … … FUSE Compatible File System Interface Hadoop Compatible File System Interface Native Key-Value Interface Native File System Interface HDFS Interface Amazon S3 Interface Swift Interface GlusterFS Interface Enabling Application to Access Data from any Storage System at Memory-speed © 2017 Alluxio 18
  19. 19. © 2016 Mesosphere, Inc. All Rights Reserved. WHY ALLUXIO © 2017 Alluxio Co-located compute and data with memory-speed access to data Virtualized across different storage systems under a unified namespace Scale-out architecture File system API, software only 19
  20. 20. © 2016 Mesosphere, Inc. All Rights Reserved. ALLUXIO BENEFITS © 2017 Alluxio Unification New workflows across any data in any storage system Orders of magnitude improvement in run time Choice in compute and storage – grow each independently, buy only what is needed Performance Flexibility 20
  21. 21. © 2017 Mesosphere, Inc. All Rights Reserved. 21© 2017 Alluxio
  22. 22. © 2016 Mesosphere, Inc. All Rights Reserved. 22 WHY DATA SERVICES ON DC/OS? On-demand provisioning1 2 3 Simplified operations Elastic data infrastructure ● Single command install of services ● Runtime software upgrade ● Runtime application settings update ● Monitoring & metrics ● Managed persistent storage volumes ● Data services and containerized apps share resources ● Deploy instances with different versions on the same infrastructure ● Resize instances ● Add more instances © 2017 Alluxio
  23. 23. © 2016 Mesosphere, Inc. All Rights Reserved. 23 ALLUXIO ON MESOSPHERE DC/OS Fast, On-demand Unified Data at Memory Speed for Analytics Alluxio Mesosphere DC/OS Any Infrastructure Build apps once in DC/OS, and run anywhere Runs distributed apps anywhere as simply as running apps on your laptop Unify Data at Memory Speed Unify Data at Memory Speed © 2017 Alluxio
  24. 24. © 2016 Mesosphere, Inc. All Rights Reserved. 24 ALLUXIO ON MESOSPHERE DC/OS Fast, On-demand Unified Data at Memory Speed for Analytics © 2017 Alluxio
  25. 25. © 2016 Mesosphere, Inc. All Rights Reserved. WHY ALLUXIO ON MESOSPHERE DC/OS? ● Without Mesosphere DC/OS, provisioning of infrastructure is tedious ○ Mesosphere DC/OS automates app & cluster provisioning, management & elastic scaling ● Alluxio brings ○ A unified view of data across disparate storage systems ○ High performance & predictable SLA for analytics workloads ● Benefits include: ○ Process data in your existing cluster faster with Spark and other analytics frameworks ○ Process data from hybrid cloud storage systems (HDFS, S3, On-prem Object Stores etc) © 2017 Alluxio 25
  26. 26. © 2016 Mesosphere, Inc. All Rights Reserved. 26 BIG DATA STACK WITH ALLUXIO ON MESOSPHERE DC/OS Fast, On-demand Unified Data at Memory Speed for Analytics Mesos Container Orchestration Management & Monitoring Tools Apps Universe Security Advanced Operations Multitenancy Adv. Network & Storage Unifying Data at Memory Speed © 2017 Alluxio
  27. 27. © 2017 Mesosphere, Inc. All Rights Reserved. 27© 2017 Alluxio DEMO
  28. 28. © 2016 Mesosphere, Inc. All Rights Reserved. WHAT HAPPENED? ● Alluxio scheduler (developed using the DC/OS SDK) launched as a Marathon application ○ Marathon manages and restarts the scheduler in case of failures ○ Scheduler consists of YAML + scripting ● Alluxio scheduler launched master and worker processes ○ Scheduler manages the configured number of instances even w/ failures ● Configuration changes take effect on the fly ○ Scaled up the worker instances © 2017 Alluxio 28
  29. 29. © 2016 Mesosphere, Inc. All Rights Reserved. GET STARTED TODAY Read: ● Mesosphere Blog: http://ow.ly/ou0530ax9aM ● Alluxio Blog: http://ow.ly/ILOZ30ax8YE Try it out: ● Install Alluxio from DC/OS Universe Questions? © 2017 Alluxio 29

Related Articles

sstable
cassandra
spark

Spark and Cassandra’s SSTable loader

Arunkumar

11/1/2024

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt! 
We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company

Explore Related Topics

AllKafkaSparkScyllaSStableKubernetesApiGithubGraphQl

Explore Further

mesos