Illustration Image

Cassandra.Link

The best knowledge base on Apache Cassandra®

Helping platform leaders, architects, engineers, and operators build scalable real time data platforms.

4/9/2020

Reading time:5 min

strongloop/loopback-connector-cassandra

by strongloop

The official Cassandra Connector module for loopback-datasource-juggler.Please also see LoopBack Cassandra Connector in LoopBack documentation.InstallationIn your application root directory, enter this command to install the connector:npm install loopback-connector-cassandra --saveThis installs the module from npm and adds it as a dependency to the application's package.json file.If you create a Cassandra data source using the data source generator as described below, you don't have to do this, since the generator will run npm install for you.Creating a Cassandra data sourceUse the Data source generator to add a Cassandra data source to your application. Select Cassandra connector as follows:$ lb datasource? Enter the data-source name: mycass? Select the connector for mycass: IBM Cloudant DB (supported by StrongLoop) IBM DB2 for z/OS (supported by StrongLoop) IBM WebSphere eXtreme Scale key-value connector (supported by StrongLoop) ❯ Cassandra (supported by StrongLoop) Redis key-value connector (supported by StrongLoop) MongoDB (supported by StrongLoop) MySQL (supported by StrongLoop) (Move up and down to reveal more choices)The generator will then prompt for the database server hostname, port, and other settings required to connect to a Cassandra database. It will also run the npm install command for you.$ lb datasource? Enter the data-source name: mycass? Select the connector for mycass: Cassandra (supported by StrongLoop)Connector-specific configuration:? host: localhost? port: 9042? user: ? password: ? database: test? connectTimeout(ms): 30000? readTimeout(ms): 30000? Install loopback-connector-cassandra@^1.0.0 Yesloopback-connector-cassandra@1.0.0 node_modules/loopback-connector-cassandra...The entry in the application's /server/datasources.json will look like this:"mycass": { "host": "localhost", "port": 9042, "database": "test", "password": "", "name": "mycass", "user": "", "connectTimeout": 30000, "readTimeout": 30000, "connector": "cassandra"}Edit datasources.json to add any other additional properties supported by cassandra-driver.Type mappingsSee LoopBack types for details on LoopBack's data types.LoopBack to/from Cassandra typesIn addition to the standard data types such as String, Boolean, and Number, several Cassandra specific types are supported as shown in the table blow.LoopBack TypeCassandra TypeUuidUUIDTimeUuidTIMEUUIDTupleTUPLEPrimary KeysAuto generated partition keysIn case no id is defined, LoopBack adds id with Cassandra connector's default type: Uuid.LoopBack notation:zipCodes = db.define('zipCodes', { state: String, zipCode: Number, });Cql equivalent:CREATE TABLE zipCodes ( state TEXT, zipCode INT, id UUID, PRIMARY KEY (id));User defined partition keysLoopBack notation:When id: true is defined, LoopBack does not add id and uses it as a partition key.customers = db.define('customers', { name: String, state: String, zipCode: Number, userId: {type: 'TimeUuid', id: true}, });Cql equivalent:CREATE TABLE customers ( name TEXT, state TEXT, zipCode INT, userId TIMEUUID, PRIMARY KEY (userId));Compound partition keysLoopBack notation:id value can be either boolean or number (base 1). Compound partition key is created by combining the ones with number in ascending order then boolean. In case conflict, first-come first-served.customers = db.define('customers', { isSignedUp: {type: Boolean, id: 2}, state: String, contactSalesRep: {type: String, id: true}, zipCode: Number, userId: {type: Number, id: 1}, });Cql equivalent:CREATE TABLE customers ( isSignedUp BOOLEAN, state TEXT, contactSalesRep TEXT, zipCode INT, userId INT, PRIMARY KEY ((userId, isSignedUp, contactSalesRep)));Clustering keys and SortingCassandra stores data on each node according to the hashed TOKEN value of the partition key in the range that the node is responsible for. Since hashed TOKEN values are generally random, find with limit: 10 filter will return apparently random 10 (or less) rows. The Cassandra connector supports on-disk sorting by setting clustering key as ASCending or DESCending at table creation time. order filter is ignored. Since sorting is done on node by node basis, the returned result is property sorted only when the partition key is specified.For example, in case you want to find the most recently added row, create a table with time-based column as a clustering key with DESC property. Then, use find with limit: 1 or findOne.Concrete example is as follows assuming all the rows fall in the same partition range. Note that clusteringKeys is defined as an array because the order of the sorting keys is important:isSignedUpstatecontactSalesRepzipCodeuserIdtrueArizonaTed Johnson850032003trueArizonaDavid Smith8500216002trueArizonaMary Parker8500115001trueCaliforniaDavid Smith9000121002trueColoradoMary Parker800022010trueColoradoJane Miller8000112002trueNevadaTed Johnson7517328006LoopBack notation:Cassandra connector supports clustering key as a custom option. Sorting order can be associated with clustering keys as ASC or DESC.customers = db.define('customers', { isSignedUp: {type: Boolean, id: true}, state: String, contactSalesRep: String, zipCode: Number, userId: Number, }, { cassandra: { clusteringKeys: ['state', 'zipCode DESC'], }, });Cql equivalent:CREATE TABLE customers ( isSignedUp BOOLEAN, state TEXT, contactSalesRep TEXT, zipCode INT, userId INT, PRIMARY KEY (isSignedUp, state, zipCode)) WITH CLUSTERING ORDER BY (state ASC, zipCode DESC);Secondary IndexesAdditional searchable fields can be defined as secondary indexes. For example, in case the table customers below is defined with name as just {type: String}, then find with where: {name: "Martin Taylor"} filter will fail. However, find with where: {namee: "Martin Taylor"} filter will succeed on the table defined with index: true as follows:LoopBack notation:customers = db.define('customers', { name: {type: String, index: true}, userId: {type: Number, id: true}, });Cql equivalent:CREATE TABLE customers ( name TEXT, userId INT, PRIMARY KEY (userId));CREATE INDEX ON customers (name);V1 LimitationsBecause of the Cassandra architecture, Cassandra connector V1 supports where and limit. Other filter conditions are not supported.order filter not supportedUse clustering keys for sorting. The database side sorting determines the order or rows to be return when ordering matters such as where limit or findOne. Ad hoc sorting with sort filter is not supported.or filter not supportedand is supported, but or is not in where filter.offset is not supportedPagination is not supported in V1.Running testsOwn instanceIf you have a local or remote Cassandra instance and would like to use that to run the test suite, use the following command:CASSANDRA_HOST=<HOST> CASSANDRA_PORT=<PORT> CASSANDRA_KEYSPACE=<KEYSPACE> CI=true npm testWindowsSET CASSANDRA_HOST=<HOST>SET CASSANDRA_PORT=<PORT>SET CASSANDRA_KEYSPACE=<KEYSPACE>SET CI=truenpm testDockerIf you do not have a local Cassandra instance, you can also run the test suite with very minimal requirements.Assuming you have Docker installed, run the following script which would spawn a Cassandra instance on your local:source setup.sh <HOST> <PORT> <KEYSPACE>where <HOST>, <PORT> and <KEYSPACE> are optional parameters. The default values are localhost, 9042 and test respectively.Run the test:npm test

Illustration Image

The official Cassandra Connector module for loopback-datasource-juggler.

Please also see LoopBack Cassandra Connector in LoopBack documentation.

Installation

In your application root directory, enter this command to install the connector:

npm install loopback-connector-cassandra --save

This installs the module from npm and adds it as a dependency to the application's package.json file.

If you create a Cassandra data source using the data source generator as described below, you don't have to do this, since the generator will run npm install for you.

Creating a Cassandra data source

Use the Data source generator to add a Cassandra data source to your application. Select Cassandra connector as follows:

$ lb datasource
? Enter the data-source name: mycass
? Select the connector for mycass: 
  IBM Cloudant DB (supported by StrongLoop) 
  IBM DB2 for z/OS (supported by StrongLoop) 
  IBM WebSphere eXtreme Scale key-value connector (supported by StrongLoop) 
❯ Cassandra (supported by StrongLoop) 
  Redis key-value connector (supported by StrongLoop) 
  MongoDB (supported by StrongLoop) 
  MySQL (supported by StrongLoop) 
(Move up and down to reveal more choices)

The generator will then prompt for the database server hostname, port, and other settings required to connect to a Cassandra database. It will also run the npm install command for you.

$ lb datasource
? Enter the data-source name: mycass
? Select the connector for mycass: Cassandra (supported by StrongLoop)
Connector-specific configuration:
? host: localhost
? port: 9042
? user: 
? password: 
? database: test
? connectTimeout(ms): 30000
? readTimeout(ms): 30000
? Install loopback-connector-cassandra@^1.0.0 Yes
loopback-connector-cassandra@1.0.0 node_modules/loopback-connector-cassandra
...

The entry in the application's /server/datasources.json will look like this:

"mycass": {
  "host": "localhost",
  "port": 9042,
  "database": "test",
  "password": "",
  "name": "mycass",
  "user": "",
  "connectTimeout": 30000,
  "readTimeout": 30000,
  "connector": "cassandra"
}

Edit datasources.json to add any other additional properties supported by cassandra-driver.

Type mappings

See LoopBack types for details on LoopBack's data types.

LoopBack to/from Cassandra types

In addition to the standard data types such as String, Boolean, and Number, several Cassandra specific types are supported as shown in the table blow.

LoopBack Type Cassandra Type
Uuid UUID
TimeUuid TIMEUUID
Tuple TUPLE

Primary Keys

Auto generated partition keys

In case no id is defined, LoopBack adds id with Cassandra connector's default type: Uuid.

LoopBack notation:

zipCodes = db.define('zipCodes', {
  state: String,
  zipCode: Number,
  });

Cql equivalent:

CREATE TABLE zipCodes (
   state TEXT,
   zipCode INT,
   id UUID,
   PRIMARY KEY (id)
);

User defined partition keys

LoopBack notation:

When id: true is defined, LoopBack does not add id and uses it as a partition key.

customers = db.define('customers', {
  name: String,
  state: String,
  zipCode: Number,
  userId: {type: 'TimeUuid', id: true},
  });

Cql equivalent:

CREATE TABLE customers (
   name TEXT,
   state TEXT,
   zipCode INT,
   userId TIMEUUID,
   PRIMARY KEY (userId)
);

Compound partition keys

LoopBack notation:

id value can be either boolean or number (base 1). Compound partition key is created by combining the ones with number in ascending order then boolean. In case conflict, first-come first-served.

customers = db.define('customers', {
  isSignedUp: {type: Boolean, id: 2},
  state: String,
  contactSalesRep: {type: String, id: true},
  zipCode: Number,
  userId: {type: Number, id: 1},
  });

Cql equivalent:

CREATE TABLE customers (
   isSignedUp BOOLEAN,
   state TEXT,
   contactSalesRep TEXT,
   zipCode INT,
   userId INT,
   PRIMARY KEY ((userId, isSignedUp, contactSalesRep))
);

Clustering keys and Sorting

Cassandra stores data on each node according to the hashed TOKEN value of the partition key in the range that the node is responsible for. Since hashed TOKEN values are generally random, find with limit: 10 filter will return apparently random 10 (or less) rows. The Cassandra connector supports on-disk sorting by setting clustering key as ASCending or DESCending at table creation time. order filter is ignored. Since sorting is done on node by node basis, the returned result is property sorted only when the partition key is specified.

For example, in case you want to find the most recently added row, create a table with time-based column as a clustering key with DESC property. Then, use find with limit: 1 or findOne.

Concrete example is as follows assuming all the rows fall in the same partition range. Note that clusteringKeys is defined as an array because the order of the sorting keys is important:

isSignedUp state contactSalesRep zipCode userId
true Arizona Ted Johnson 85003 2003
true Arizona David Smith 85002 16002
true Arizona Mary Parker 85001 15001
true California David Smith 90001 21002
true Colorado Mary Parker 80002 2010
true Colorado Jane Miller 80001 12002
true Nevada Ted Johnson 75173 28006

LoopBack notation:

Cassandra connector supports clustering key as a custom option. Sorting order can be associated with clustering keys as ASC or DESC.

customers = db.define('customers', {
  isSignedUp: {type: Boolean, id: true},
  state: String,
  contactSalesRep: String,
  zipCode: Number,
  userId: Number,
  }, {
  cassandra: {
    clusteringKeys: ['state', 'zipCode DESC'],
    },
  });

Cql equivalent:

CREATE TABLE customers (
   isSignedUp BOOLEAN,
   state TEXT,
   contactSalesRep TEXT,
   zipCode INT,
   userId INT,
   PRIMARY KEY (isSignedUp, state, zipCode)
) WITH CLUSTERING ORDER BY (state ASC, zipCode DESC);

Secondary Indexes

Additional searchable fields can be defined as secondary indexes. For example, in case the table customers below is defined with name as just {type: String}, then find with where: {name: "Martin Taylor"} filter will fail. However, find with where: {namee: "Martin Taylor"} filter will succeed on the table defined with index: true as follows:

LoopBack notation:

customers = db.define('customers', {
  name: {type: String, index: true},
  userId: {type: Number, id: true},
  });

Cql equivalent:

CREATE TABLE customers (
   name TEXT,
   userId INT,
   PRIMARY KEY (userId)
);
CREATE INDEX ON customers (name);

V1 Limitations

Because of the Cassandra architecture, Cassandra connector V1 supports where and limit. Other filter conditions are not supported.

order filter not supported

Use clustering keys for sorting. The database side sorting determines the order or rows to be return when ordering matters such as where limit or findOne. Ad hoc sorting with sort filter is not supported.

or filter not supported

and is supported, but or is not in where filter.

offset is not supported

Pagination is not supported in V1.

Running tests

Own instance

If you have a local or remote Cassandra instance and would like to use that to run the test suite, use the following command:

CASSANDRA_HOST=<HOST> CASSANDRA_PORT=<PORT> CASSANDRA_KEYSPACE=<KEYSPACE> CI=true npm test
  • Windows
SET CASSANDRA_HOST=<HOST>
SET CASSANDRA_PORT=<PORT>
SET CASSANDRA_KEYSPACE=<KEYSPACE>
SET CI=true
npm test

Docker

If you do not have a local Cassandra instance, you can also run the test suite with very minimal requirements.

  • Assuming you have Docker installed, run the following script which would spawn a Cassandra instance on your local:
source setup.sh <HOST> <PORT> <KEYSPACE>

where <HOST>, <PORT> and <KEYSPACE> are optional parameters. The default values are localhost, 9042 and test respectively.

  • Run the test:
npm test

Related Articles

spring
angular
rest

GitHub - jhipster/jhipster-sample-app-cassandra: This is a sample application created with JHipster, with the Cassandra option

jhipster

3/7/2024

cassandra
rest

Checkout Planet Cassandra

Claim Your Free Planet Cassandra Contributor T-shirt!

Make your contribution and score a FREE Planet Cassandra Contributor T-Shirt! 
We value our incredible Cassandra community, and we want to express our gratitude by sending an exclusive Planet Cassandra Contributor T-Shirt you can wear with pride.

Join Our Newsletter!

Sign up below to receive email updates and see what's going on with our company

Explore Related Topics

AllKafkaSparkScyllaSStableKubernetesApiGithubGraphQl

Explore Further

cassandra