Skip to content
DataMiner DoJo

More results...

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Search in posts
Search in pages
Log in
Menu
  • Updates & Insights
  • Questions
  • Learning
    • E-learning Courses
    • Empower Replay: Limited Edition
    • Tutorials
    • Open Classroom Training
    • Certification
      • DataMiner Fundamentals
      • DataMiner Configurator
      • DataMiner Automation
      • Scripts & Connectors Developer: HTTP Basics
      • Scripts & Connectors Developer: SNMP Basics
      • Visual Overview – Level 1
      • Verify a certificate
    • Video Library
    • Books We Like
    • >> Go to DataMiner Docs
  • Expert Center
    • Solutions & Use Cases
      • Solutions
      • Use Case Library
    • Markets & Industries
      • Media production
      • Government & defense
      • Content distribution
      • Service providers
      • Partners
      • OSS/BSS
    • Agile
      • Agile Webspace
      • Everything Agile
        • The Agile Manifesto
        • Best Practices
        • Retro Recipes
      • Methodologies
        • The Scrum Framework
        • Kanban
        • Extreme Programming
      • Roles
        • The Product Owner
        • The Agile Coach
        • The Quality & UX Coach (QX)
    • DataMiner DevOps Professional Program
      • About the DevOps Program
      • DataMiner DevOps Support
  • Downloads
  • More
    • DataMiner Releases & Updates
    • Feature Suggestions
    • Climb the leaderboard!
    • Swag Shop
    • Contact
    • Global Feedback Survey
  • PARTNERS
    • All Partners
    • Technology Partners
    • Strategic Partner Program
    • Deal Registration
  • >> Go to dataminer.services

Cassandra cluster sync status?

Solved1.06K views18th July 2023Cassandra
0
Jeff Douglass860 27th October 2021 1 Comment

In a Cassandra based DM failover configuration, how can one tell if the 2 Cassandra node are %100 in sync, contain the exact same data? My understanding is that the nodetool status command’s “Load” column value for each node will match if they are in sync but mine never match and I have seen in them be considerably different, sometimes close to a factor of 2. For example currently it is showing DMA 1 = 1.38G and DMA 2 = 76.92M. This is a small test system so not a large database but that difference seams to be an indication that the 2 nodes do not contain the same data.

Thanks

Marieke Goethals [SLC] [DevOps Catalyst] Selected answer as best 18th July 2023
Marieke Goethals [SLC] [DevOps Catalyst] commented 18th July 2023

As this question has been inactive for a long time, we will now close it. If you still want further information, could you post a new question?

1 Answer

  • Active
  • Voted
  • Newest
  • Oldest
0
Bert Vervaele [SLC]50 Posted 28th October 2021 2 Comments

Hi Jeff we recentyl recieved a similar question and this was the Cassandra champions answer:

Having a large size difference could be normal as it highly depends on when the repair/compaction actions happen.

To give you an example you can see below graph.

When a compaction is performed (should be triggered on both nodes on different time) you will see the live space used going down.

When you perform a repair it could be that the size increases (if not all data was on both nodes).

If you would have a lot of RealTime trending (data kept for 1 day by default) you will see large drops when performing compaction.

That is hen you could have larger differences than 10% which could be perfectly normal.

I would say that it is more important to check if there are no hint files created and if there are no repairs failing.

Marieke Goethals [SLC] [DevOps Catalyst] Selected answer as best 18th July 2023
Jeff Douglass commented 28th October 2021

Looks like Jeroen also posted a response but it is no longer visible. In response to his post I received by email, my test system is a basic standard DM 10.1 CU07 failover configuration running Cassandra on both DMAs, not dedicated dbase servers. Nodetool status is still showing a similar difference in Load values this morning. I have never run the repair operation due to not seeing a need to. I am not aware of any conditions, such as a DMA going offline for any period of time, that could account for this difference and trigger the need to run repair

Jeff Douglass commented 28th October 2021

I just ran the nodetool repair operation and the status still returns a significant difference in the Loads. DMA 1 = 274M DMA 2=1.48G. Shouldn’t these match almost exactly after the repair? The repair operation returned with “completed successfully”

Please login to be able to comment or post an answer.

My DevOps rank

DevOps Members get more insights on their profile page.

My user earnings

0 Dojo credits

Spend your credits in our swag shop.

0 Reputation points

Boost your reputation, climb the leaderboard.

Promo banner DataMiner DevOps Professiona Program
DataMiner Integration Studio (DIS)
Empower Katas
Privacy Policy • Terms & Conditions • Contact

© 2025 Skyline Communications. All rights reserved.

DOJO Q&A widget

Can't find what you need?

? Explore the Q&A DataMiner Docs