Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

2/21/2019

Reading time:1 min

scylladb/scylla-migrator

by John Doe

Make sure sbt is installed on your machine, and run build.sh.Create a config.yaml for your migration using the template config.yaml in the repository root. Read the comments throughout carefully.The Scylla Migrator is built against Spark 2.3.1, so you'll need to run that version on your cluster.After running build.sh, copy the jar from ./target/scala-2.11/scylla-migrator-assembly-0.0.1.jar and the config.yaml you've created to the Spark master server.Then, run this command on the Spark master server:spark-submit --class com.scylladb.migrator.Migrator \ --master spark://<spark-master-hostname>:7077 \ --conf spark.scylla.config=<path to config.yaml> <path to scylla-migrator-assembly-0.0.1.jar>To run in the local Docker-based setup:First start the environment:docker-compose up -dLaunch cqlsh in Cassandra's container and create a keyspace and a table with some data:docker-compose exec cassandra cqlsh<create stuff>Launch cqlsh in Scylla's container and create the destination keyspace and table with the same schema as the source table:docker-compose exec scylla cqlsh<create stuff>Edit the config.yaml file; note the comments throughout.Run build.sh.Then, launch spark-submit in the master's container to run the job:docker-compose exec spark-master spark-submit --class com.scylladb.migrator.Migrator \ --master spark://spark-master:7077 \ --conf spark.driver.host=spark-master \ --conf spark.scylla.config=/app/config.yaml \ /jars/scylla-migrator-assembly-0.0.1.jarThe spark-master container mounts the ./target/scala-2.11 dir on /jars and the repository root on /app. To update the jar with new code, just run build.sh and then run spark-submit again.

Illustration Image

Read this article if you want to know more about scylladb/scylla-migrator

Make sure sbt is installed on your machine, and run build.sh.

Create a config.yaml for your migration using the template config.yaml in the repository root. Read the comments throughout carefully.

The Scylla Migrator is built against Spark 2.3.1, so you'll need to run that version on your cluster.

After running build.sh, copy the jar from ./target/scala-2.11/scylla-migrator-assembly-0.0.1.jar and the config.yaml you've created to the Spark master server.

Then, run this command on the Spark master server:

spark-submit --class com.scylladb.migrator.Migrator \
  --master spark://<spark-master-hostname>:7077 \
  --conf spark.scylla.config=<path to config.yaml>
  <path to scylla-migrator-assembly-0.0.1.jar>

To run in the local Docker-based setup:

First start the environment:

docker-compose up -d

Launch cqlsh in Cassandra's container and create a keyspace and a table with some data:

docker-compose exec cassandra cqlsh
<create stuff>

Launch cqlsh in Scylla's container and create the destination keyspace and table with the same schema as the source table:

docker-compose exec scylla cqlsh
<create stuff>

Edit the config.yaml file; note the comments throughout.
Run build.sh.
Then, launch spark-submit in the master's container to run the job:

docker-compose exec spark-master spark-submit --class com.scylladb.migrator.Migrator \
  --master spark://spark-master:7077 \
  --conf spark.driver.host=spark-master \
  --conf spark.scylla.config=/app/config.yaml \
  /jars/scylla-migrator-assembly-0.0.1.jar

The spark-master container mounts the ./target/scala-2.11 dir on /jars and the repository root on /app. To update the jar with new code, just run build.sh and then run spark-submit again.

Related Articles

GitHub - datastax/cql-proxy: A client-side CQL proxy/sidecar.

datastax

11/1/2024

GitHub - datastax/cql-proxy: A client-side CQL proxy/sidecar.

',d,a,t,a,s,t,a,x,'

11/1/2024

Migrate to Aiven for Apache Cassandra® with no downtime | Aiven docs

John Doe

11/1/2024

Migrate to Aiven for Apache Cassandra® with no downtime | Aiven docs

John Doe

11/1/2024

GitHub - datastax/zdm-proxy: An open-source component designed to seamlessly handle the real-time client application activity while a migration is in progress.

datastax

11/1/2024

GitHub - datastax/zdm-proxy: An open-source component designed to seamlessly handle the real-time client application activity while a migration is in progress.

',d,a,t,a,s,t,a,x,'

11/1/2024

Spark and Cassandra’s SSTable loader

Arunkumar

11/1/2024

GitHub - apache/cassandra-analytics: Apache cassandra

apache

9/4/2024

Build an Event-Driven Architecture with Apache Kafka, Apache Spark, and Apache Cassandra

DataStax

8/3/2024

Vald

John Doe

2/11/2024

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt!  We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company

Explore Related Topics

AllKafkaSparkScyllaSStableKubernetesApiGithubGraphQl

Explore Further

scylladb

Vald

John Doe

2/11/2024

Our Top NoSQL Blogs of the Year: Rust, Raft, MongoDB, Books & Tablets

John Doe

1/5/2024

GitHub - eighty4/cquill: Versioned CQL migrations for Cassandra and ScyllaDB

eighty4

12/2/2023

Apache Cassandra 4.0 vs. ScyllaDB 4.4: Comparing Performance

Peter Corless

2/16/2023

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company.

Contact Info

3 Washington Circle NW Suite 301 - Washington, D.C. 20037

support@anant.us

(855) 262-6526

Resources

Services

Careers

Events

Contact Us

Open Source Tools

Properties

Blog

Cassandra.Link

Cassandra.Tools

Anant Playbook

Awesome Cassandra

Follow Us

Github

Youtube

Twitter

Linkedin

Facebook

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company.

Illustration Image

Illustration Image

© 2023 Anant Corporation

Apache, the Apache feather logo, Apache Cassandra, Cassandra, and the Cassandra logo, are either registered trademarks or trademarks of The Apache Software Foundation.