9/9/2020

Reading time:1 min

Understanding the architecture

by John Doe

Components of a DataStax Apache Kafka Connector implementation.The DataStax Apache Kafka™ Connector is deployed on the Kafka Connect Worker nodes and runs within the worker JVM. The Kafka Connect Worker Framework handles automatic rebalancing of tasks when new nodes are added and also ships with a built-in REST API for operator actions. Running the connector in this framework enables multiple DataStax connector instances to share the load and to scale horizontally when run in Distributed Mode. The diagram below shows how the DataStax Apache Kafka Connector fits into the Kafka ecosystem. The environment is comprised of the following components:Data sources - Original source of the data, such as databases, applications, and other services like Salesforce and Twitter. Kafka platformKafka brokers - Responsible for storing Kafka topics. Kafka connect workers - The nodes running the Kafka connect framework that run producer and consumer plug-ins (Kafka connectors). Source connectors - Push messages (data) from the original sources to Kafka brokers. Sink connectors - Workers running one or more instances of the DataStax Kafka Connector, which pulls messages from Kafka topics and writes them to a database table on the DataStax platform using the DataStax Enteprise Java driver. DataStax platform - DataStax Apache Kafka Connector writes to nodes in a cluster that are uniformly licensed to use the same subscription. For example, if a cluster contains five nodes, all five must be licensed to use one of the following technologies:Open source Apache Cassandra® 2.1 and later databases DataStax Astra cloud databases DataStax Enterprise (DSE) 4.7 and later databases

Read this article if you want to know more about Understanding the architecture

Components of a DataStax Apache Kafka Connector implementation.

The DataStax Apache Kafka™ Connector is deployed on the Kafka Connect Worker nodes and runs within the worker JVM. The Kafka Connect Worker Framework handles automatic rebalancing of tasks when new nodes are added and also ships with a built-in REST API for operator actions. Running the connector in this framework enables multiple DataStax connector instances to share the load and to scale horizontally when run in Distributed Mode. The diagram below shows how the DataStax Apache Kafka Connector fits into the Kafka ecosystem.

The environment is comprised of the following components:

Data sources - Original source of the data, such as databases, applications, and other services like Salesforce and Twitter.
Kafka platform
- Kafka brokers - Responsible for storing Kafka topics.
- Kafka connect workers - The nodes running the Kafka connect framework that run producer and consumer plug-ins (Kafka connectors).
  - Source connectors - Push messages (data) from the original sources to Kafka brokers.
  - Sink connectors - Workers running one or more instances of the DataStax Kafka Connector, which pulls messages from Kafka topics and writes them to a database table on the DataStax platform using the DataStax Enteprise Java driver.
DataStax platform - DataStax Apache Kafka Connector writes to nodes in a cluster that are uniformly licensed to use the same subscription. For example, if a cluster contains five nodes, all five must be licensed to use one of the following technologies:
- Open source Apache Cassandra® 2.1 and later databases
- DataStax Astra cloud databases
- DataStax Enterprise (DSE) 4.7 and later databases

Related Articles

migration

proxy

datastax

GitHub - datastax/zdm-proxy: An open-source component designed to seamlessly handle the real-time client application activity while a migration is in progress.

datastax

11/1/2024

migration

proxy

datastax

GitHub - datastax/zdm-proxy: An open-source component designed to seamlessly handle the real-time client application activity while a migration is in progress.

',d,a,t,a,s,t,a,x,'

11/1/2024

cassandra

event.driven

spark

Build an Event-Driven Architecture with Apache Kafka, Apache Spark, and Apache Cassandra

DataStax

8/3/2024

cloud

kubernetes

datastax

DataStax Hyper-Converged Database: The Future of Data Infrastructure Is Here | DataStax

Patrick McFadin

7/11/2024

cluster

troubleshooting

datastax

GitHub - arodrime/Montecristo: Datastax Cluster Health Check Tooling

arodrime

4/3/2024

analytics

streaming

visualization

Keen - Event Streaming Platform

John Doe

2/3/2024

mongo

cassandra

kafka

Top 10 Real-Time Databases to Use in 2024

K.sabreena

1/5/2024

node

hybrid.cloud

datastax

GitHub - IBM/datastax-cassandra-clickstream: Use DataStax Enterprise built on Apache Cassandra as a clickstream database

IBM

12/8/2023

examples

cassandra

datastax

GitHub - datastaxdevs/workshop-betterreads: Clone of Good Reads using Spring and Cassandra

datastaxdevs

12/2/2023

examples

cassandra

datastax

NoSQL Database Built on Apache Cassandra | DataStax

John Doe

12/2/2023

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt!  We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Contact Info

Resources

Properties

Follow Us