Skip to content
DataMiner Dojo

More results...

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Search in posts
Search in pages
Log in
Menu
  • Updates & Insights
  • Questions
  • Learning
    • E-learning Courses
    • Tutorials
    • Open Classroom Training
    • Certification
      • DataMiner Fundamentals
      • DataMiner Configurator
      • DataMiner Automation
      • Scripts & Connectors Developer: HTTP Basics
      • Scripts & Connectors Developer: SNMP Basics
      • Visual Overview – Level 1
      • Verify a certificate
    • YouTube Videos
    • Solutions & Use Cases
      • Solutions
      • Use Case Library
    • Agility
      • Learn more about Agile
        • Agile Webspace
        • Everything Agile
          • The Agile Manifesto
          • Best Practices
          • Retro Recipes
        • Methodologies
          • The Scrum Framework
          • Kanban
          • Extreme Programming
        • Roles
          • The Product Owner
          • The Agile Coach
          • The Quality & UX Coach (QX)
      • Book your Agile Fundamentals training
      • Book you Kanban workshop
    • >> Go to DataMiner Docs
  • DevOps
    • About the DevOps Program
    • Sign up for the DevOps Program
    • DataMiner DevOps Support
    • Feature Suggestions
  • Downloads
  • Swag Shop
  • PARTNERS
    • Business Partners
    • Technology Partners
  • Contact
    • Sales, Training & Certification
    • DataMiner Support
    • Global Feedback Survey
  • >> Go to dataminer.services

Cassandra, consistencylevels and replication

52 views7 hours agoCassandra replication
2
Chris Glover [DevOps Advocate]1.87K 7 hours ago 0 Comments

Hi,

We have a DMS that spans two sites, with a fast private network between them. Our Cassandra cluster is configured with two Datacenters, each DC has two servers. Replication is configured such that every node has a full copy of the data.

We have a total of 20 DMAs, 10 on each site, split into two availablity zones on each site, 5 nodes in each. The cassandra hosts are also split across each availability zone. in DB.xml consistencyLevel = One. The DB.xml also lists all Cassandra nodes, not just their local ones. Any DMA can talk to any Cassandra node.

We've been experimenting with fault tolerance.

1.) Shutting down one node on a site

We found we were unable to create elements on the site that was missing one out of two nodes Dataminer produced the unable to save element dialogue. Creating elements on the other site was successful.

As all three other cassandra nodes were up and accessible, we was expecting this to work, as only one node has to report data written successfully

Restarting a DMA on the site that was missing a Cassandra node caused the DMA to connect to the other site successfully, and we were then able to create elements on the DMA that was restarted.

This error was also seen:-

Unexpected Exception [Unable to successfully create a storage with ID 6433f867-aec3-4c9a-9b7d-077826bb596d:
An exception occurred when creating a storage: Unable to perform authorization of login permission: Cannot achieve consistency level LOCAL_QUORUM]:
Parameter Notification Thread (   at Skyline.DataMiner.Core.StorageModule.Storages.StorageFactory.<GetStorage>d__11`1.MoveNext()
--- End of stack trace from previous location where exception was thrown ---
   at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
   at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
   at Skyline.DataMiner.Core.DMAObjects.Config.Elements.Storages.ElementStorageFactory.CreateDescriptionStorage(Guid storage)
   at Skyline.DataMiner.Core.DMAObjects.Config.Elements.Storages.ElementCRUD.Initialize()
   at Skyline.DataMiner.Net.MessageBuilder.BuildElementInfoEventMessage(Object[] elementInfo)
   at Skyline.DataMiner.Net.MessageHandler.ElementInfoToMessage(Object[] elementInfo, Dictionary`2 tmpCache)
   at Skyline.DataMiner.Net.MessageHandler.BuildSLNetCOMNotifications(SLNetRequestType type, Int32 subType, Int32 subType2, Object params1, Object params2)
   at Skyline.DataMiner.Net.Notifications.Parameters.ParameterNotificationManager.ExecuteItem(NotificationQueueItem itemToExecute))

I'm confused as to why it's trying for LOCAL QUORUM, with consistencyLevel is set to One.
2.) Shutting down both Cassandras at one site, prevented Datamienr from creating elements anywhere

We was expecting the other site to still allow element creation on the hosts that could communicate with the Cassandras on the other site.

The DB.xml specifies the hosts as follows on Site A:-

<DBServer>A1,A2,B1,B2</DBServer>

On Site B

<DBServer>B1,B2,A1,A2</DBServer>

Our understanding was that with consistencyLevel = One, that so long as a configured node was contactable reads and writes should be successful, with data replicated later, once the other nodes had rejoined the cluster.

In Cassandra, we have applied the following setting to the keyspaces

{'class' : 'NetworkTopologyStrategy','A' : 2,'B' : 2}

What can we do to make this system more fault tolerant, allowing for the scenario where two cassandra nodes are offline?

Simon Declerck [SLC] [DevOps Advocate] Answered question 7 hours ago

1 Answer

  • Active
  • Voted
  • Newest
  • Oldest
2
Simon Declerck [SLC] [DevOps Advocate]1.09K Posted 7 hours ago 1 Comment

Hi Chris,

The Cassandra driver does not allow switching between DataCenters during active operation. In this scenario Cassandra expects the calling software, DataMiner in this case, to be co-housed with it in the same DataCenter with each node pointing at its local Cassandra DC. So if the Cassandra node(s) in one DC go down, DataMiner should either Failover or Swarm functionality to DataMiner nodes in the second DC which point at their local Cassandra DC which is still up. Alternatively, with the DB.xml listed, a restart of the DataMiner Agent would have it connect to the other DC as the first would no longer respond.

Cassandra will typically override the consistency level if you connect with the built in Cassandra user, which is why we recommend creating a dedicated superuser for DataMiner so the consistency that is set is respected. Could you verify whether your DB.xml is configured with the cassandra user or a dedicated one?
See Cassandra Authentication.

There are quite a few details to keep in mind with fault tolerance in self managed storage clusters. If you'd like to have a more in depth sync on this and the possible architectures, feel free to reach out to me at simon.declerck@skyline.be.

Chris Glover [DevOps Advocate] Posted new comment 6 hours ago
Chris Glover [DevOps Advocate] commented 6 hours ago

Thanks Simon!

We create a dedicated dataminer user in Cassandra

Please login to be able to comment or post an answer.

My DevOps rank

DevOps Members get more insights on their profile page.

My user earnings

0 Dojo credits

Spend your credits in our swag shop.

0 Reputation points

Boost your reputation, climb the leaderboard.

Promo banner DataMiner DevOps Professiona Program
DataMiner Integration Studio (DIS)
Empower Katas
Privacy Policy • Terms & Conditions • Contact

© 2026 Skyline Communications. All rights reserved.

DOJO Q&A widget

Can't find what you need?

? Explore the Q&A DataMiner Docs

[ Placeholder content for popup link ] WordPress Download Manager - Best Download Management Plugin