Skip to content
DataMiner DoJo

More results...

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Search in posts
Search in pages
Log in
Menu
  • Blog
  • Questions
  • Learning
    • E-learning Courses
    • Open Classroom Training
    • Certification
      • DataMiner Fundamentals
      • DataMiner Configurator
      • DataMiner Automation
      • Scripts & Connectors Developer: HTTP Basics
      • Scripts & Connectors Developer: SNMP Basics
      • Visual Overview – Level 1
      • Verify a certificate
    • Tutorials
    • Video Library
    • Books We Like
    • >> Go to DataMiner Docs
  • Expert Center
    • Solutions & Use Cases
      • Solutions
      • Use Case Library
    • Markets & Industries
      • Media production
      • Government & defense
      • Content distribution
      • Service providers
      • Partners
      • OSS/BSS
    • DataMiner Insights
      • Security
      • Integration Studio
      • System Architecture
      • DataMiner Releases & Updates
      • DataMiner Apps
    • Agile
      • Agile Webspace
      • Everything Agile
        • The Agile Manifesto
        • Best Practices
        • Retro Recipes
      • Methodologies
        • The Scrum Framework
        • Kanban
        • Extreme Programming
      • Roles
        • The Product Owner
        • The Agile Coach
        • The Quality & UX Coach (QX)
    • DataMiner DevOps Professional Program
  • Downloads
  • More
    • Feature Suggestions
    • Climb the leaderboard!
    • Swag Shop
    • Contact
      • General Inquiries
      • DataMiner DevOps Support
      • Commercial Requests
    • Global Feedback Survey
  • PARTNERS
    • All Partners
    • Technology Partners
    • Strategic Partner Program
    • Deal Registration
  • >> Go to dataminer.services

Conflicting Local Cassandra Failover Schema

Solved1.10K views5th April 2023Cassandra schema
5
Ryan Reuss [SLC] [DevOps Member]490 31st March 2023 0 Comments

Hello Dojo,

We had an issue where a failover agent suffered a complete hardware failure and needed to be reinstalled on a new machine. The backup agent was reinstalled on new hardware, but did not have a backup to restore, so a fresh 10.2 installation and upgrade to 10.2 CU11  after joining this new agent to failover we noticed a schema mismatch, after it was reported that elements were no longer working do to the previous schema being lost and a new one created.

I have attempted to resolve this mismatch by doing a nodetool drain and rolling restarts on both nodes. When that did not work we ended up breaking failover on the primary agents cube from the failover status window, reinstalling again the backup agent, setting the primary node back to localhost and from the primary node executing a nodetool removenode of the backup node as it still remained in the nodetool status after breaking failover.

After rejoining the backup agent in failover, we again have this same issue as above. It seems as though the schema conflict resides in the primary node somewhere but I am unsure how to resolve it. I am not sure where this schema mismatch could be stored in the primary node and where to go from here.

Thank you in advance for any insight and info!

Ryan Reuss [SLC] [DevOps Member] Selected answer as best 5th April 2023

3 Answers

  • Active
  • Voted
  • Newest
  • Oldest
7
Michiel Saelen [SLC] [DevOps Enabler]5.63K Posted 3rd April 2023 3 Comments

Hi Ryan,

When DataMiner starts it will create the tables if they don't exist yet. When a new Cassandra node is being added to an existing cluster, you should point it to the seeds of the existing cluster to ensure the schema is known to the new node. When the node itself is a seed it can start without contacting other seeds. Most likely, the failover agent added a blank Cassandra node as seed. When DataMiner communicates to it just after it starts it will not yet have the schema, so it will create the needed tables of the same name overwrite the existing tables.

If you end up in the situation where you have two folders of your tables with a different ID and you want to restore the data, you will need to figure out first which one is now being used by the cluster. If you still have one node using the correct tables (you can see the ID used in the tables table in the schema keyspace). You could stop the Cassandra nodes that are using the incorrect ID and add those again after cleaning them (SSTables, commitlogs, hint files and cache) and set seeds of nodes with the correct schema without setting itself as seed.

If the new tables are already accepted by all nodes, then you can restore the old data by copying the SSTables from the old folder to the new one and restarting Cassandra.

Hope this helps.

Michiel Saelen [SLC] [DevOps Enabler] Posted new comment 4th April 2023
Michiel Saelen [SLC] [DevOps Enabler] commented 3rd April 2023

Running a nodetool status on the different nodes during the process can also help you to identify what is going on.

Ryan Reuss [SLC] [DevOps Member] commented 4th April 2023

In regards to, “When a new Cassandra node is being added to an existing cluster, you should point it to the seeds of the existing cluster to ensure the schema is known to the new node.”, when you configure a failover agent from the cube, it automatically adds both IPs of the failover agents as seeds on the primary and backup cassandra.yaml files. Is this a potential software issue or is it handled differently when the node is first joined and only after they are joined does the software append the yaml to include both IPs?

Michiel Saelen [SLC] [DevOps Enabler] commented 4th April 2023

Depends a bit on how the software is handling adding a failover (they could wait until after the node is fully added to the Cassandra Cluster before connecting to it for example). Ideally, you do not configure a new node as a seed to prevent this situation, so you could see this as a bug.

You are viewing 1 out of 3 answers, click here to view all answers.
Please login to be able to comment or post an answer.

My DevOps rank

DevOps Members get more insights on their profile page.

My user earnings

0 Dojo credits

Spend your credits in our swag shop.

0 Reputation points

Boost your reputation, climb the leaderboard.

Promo banner DataMiner DevOps Professiona Program
DataMiner Integration Studio (DIS)
Empower Katas

Recent questions

Alarm Dashboard PDF/CSV Export 1 Answer | 0 Votes
Is the Microsoft SharePoint Connector Still Usable 0 Answers | 0 Votes
Is the Microsoft SharePoint Connector Still Usable 0 Answers | 0 Votes

Question Tags

adl2099 (115) alarm (62) Alarm Console (82) alarms (100) alarm template (83) Automation (223) automation scipt (111) Automation script (167) backup (71) Cassandra (180) Connector (108) Correlation (68) Cube (150) Dashboard (194) Dashboards (188) database (83) DataMiner Cube (57) DIS (81) DMS (71) DOM (139) driver (65) DVE (55) Elastic (83) Elasticsearch (115) elements (80) Failover (104) GQI (159) HTTP (76) IDP (74) LCA (151) low code app (166) low code apps (93) lowcodeapps (75) MySQL (53) protocol (203) QAction (83) security (88) services (51) SNMP (86) SRM (337) table (54) trending (87) upgrade (62) Visio (539) Visual Overview (345)
Privacy Policy • Terms & Conditions • Contact

© 2025 Skyline Communications. All rights reserved.

DOJO Q&A widget

Can't find what you need?

? Explore the Q&A DataMiner Docs

[ Placeholder content for popup link ] WordPress Download Manager - Best Download Management Plugin