Skip to content
DataMiner DoJo

More results...

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Search in posts
Search in pages
Log in
Menu
  • Blog
  • Questions
  • Learning
    • E-learning Courses
    • Open Classroom Training
    • Certification
      • DataMiner Fundamentals
      • DataMiner Configurator
      • DataMiner Automation
      • Scripts & Connectors Developer: HTTP Basics
      • Scripts & Connectors Developer: SNMP Basics
      • Visual Overview – Level 1
      • Verify a certificate
    • Tutorials
    • Video Library
    • Books We Like
    • >> Go to DataMiner Docs
  • Expert Center
    • Solutions & Use Cases
      • Solutions
      • Use Case Library
    • Markets & Industries
      • Media production
      • Government & defense
      • Content distribution
      • Service providers
      • Partners
      • OSS/BSS
    • DataMiner Insights
      • Security
      • Integration Studio
      • System Architecture
      • DataMiner Releases & Updates
      • DataMiner Apps
    • Agile
      • Agile Webspace
      • Everything Agile
        • The Agile Manifesto
        • Best Practices
        • Retro Recipes
      • Methodologies
        • The Scrum Framework
        • Kanban
        • Extreme Programming
      • Roles
        • The Product Owner
        • The Agile Coach
        • The Quality & UX Coach (QX)
    • DataMiner DevOps Professional Program
  • Downloads
  • More
    • Feature Suggestions
    • Climb the leaderboard!
    • Swag Shop
    • Contact
      • General Inquiries
      • DataMiner DevOps Support
      • Commercial Requests
    • Global Feedback Survey
  • PARTNERS
    • All Partners
    • Technology Partners
    • Strategic Partner Program
    • Deal Registration
  • >> Go to dataminer.services

Database sizing based upon alarms and alarm rate limits

Solved1.94K views14th July 2023alarm alarm rate Cassandra Elastic
1
Bing Herng Chong [SLC] [DevOps Advocate]1.06K 23rd June 2022 0 Comments

Hi, in a project that we're working on, DataMiner is being used to replace a legacy fault management system. We have been provided some numbers around alarming that we should expect:

1) the total number of alarms currently being stored and maintained by the legacy fault management system. Most recent number indicates that this is around 25 million alarms (both historical and active) out of which about 10 million are clear alarms.
2) an alarm rate of between 4000 - 12000 on a typical day maxing out at 83,000 on at least one day in the last 3 years.

Can we have some advise around how we can use these numbers to estimate the storage requirements (Elastic and Cassandra)?

We would also like some guidance around whether the rate of alarms provided is within the limits of what DataMiner has been validated to handle.

Marieke Goethals [SLC] [DevOps Catalyst] Selected answer as best 14th July 2023

2 Answers

  • Active
  • Voted
  • Newest
  • Oldest
1
Jeroen Nietvelt [SLC] [DevOps Advocate]1.34K Posted 23rd June 2022 1 Comment

Hi Bing,

Here are some alarm metrics i have from a production system which has been running for approx. 10 months on the architecture where the DataMiner cluster (6 nodes - Windows Server OS), the elasticsearch cluster (3 nodes, Red hat Linux) and cassandra cluster (3 nodes, Red hat Linux) is hosted in a vm environment where all of the nodes have their independent VM guest image:

Currently the elasticsearch cluster is holding approx. 176.000.000 unique documents. (a document compares to a row in a table based database) this is resulting in a total dataset size of approx. 330 GB's which is spread over 3 nodes, so each node is approx holding 110 GB's of data currently on disk.

There's quite a bit of alarm and info event activity on this particular platform. A last 24hr count of all activity tells me that there were 747312 alarm records / documents added:

The total stored index size on the cluster currently increases with approx. 1.7 GB's a day. which translates in an increase per node of approx. 600 MB's a day.

Marieke Goethals [SLC] [DevOps Catalyst] Selected answer as best 14th July 2023
Bing Herng Chong [SLC] [DevOps Advocate] commented 24th June 2022

Hi Jeroen, thanks for sharing what is being observed from an actual production system. It appears that the 6-node DMS is capable of handling ~750K alarms per day. That averages out to about ~125K alarms per day per node which is above the alarm rates in the scenario I provided.

1
Ive Herreman [SLC] [DevOps Enabler]13.52K Posted 23rd June 2022 1 Comment

Hi Bing,

Did you give the node calculator a try already?
It should be able to give you a general sense of what will be required.
Not only database-wise, but also the number of dataminer nodes you will need, based on the number of elements, expected trending and alarming.

Bing Herng Chong [SLC] [DevOps Advocate] Edited comment 24th June 2022
Bing Herng Chong [SLC] [DevOps Advocate] commented 24th June 2022

Hi Ive, yes I did look at the calculator. The inputs requires a number of ‘alarm updates’ which is a stat that we do not have. We do have new alarms raised per day but I think ‘alarm updates’ involves more than just new alarms.

Secondly I think the calculator assumes a fixed retention period which is currently not adjustable.

Given that we know the number of alarms that we have in the legacy system’s database (which is over a period of a number of years), I’m looking for a way to translate these into what it means in terms of Elastic DB and Cassandra DB storage size in DataMiner System. I’m thinking this method would give us a better estimate on what we’re looking at over a number of years.

Taking the example provided by Jeroen 747,312 alarms resulting in 1.7 GB implies a per alarm size of 2,443 Bytes.

Would it make sense to use this per alarm size to calculate the Elastic and Cassandra storage or is this over-simplification of the actual situation?:

– Cassandra storage = (total historical alarms + clear alarms) * 2,443 Bytes
– Elastic storage = total current alams * 2,443 Bytes

Are there any other considerations do we need to factor in (other than replication factor)?

Please login to be able to comment or post an answer.

My DevOps rank

DevOps Members get more insights on their profile page.

My user earnings

0 Dojo credits

Spend your credits in our swag shop.

0 Reputation points

Boost your reputation, climb the leaderboard.

Promo banner DataMiner DevOps Professiona Program
DataMiner Integration Studio (DIS)
Empower Katas

Recent questions

Web Applications exception in Cube due to invalid certificate 0 Answers | 0 Votes
Redundancy Groups and Alarming – Duplicate Alarms 0 Answers | 0 Votes
Correlation Engine: “Test rule” doesn’t result in a hit, despite functional rule 1 Answer | 3 Votes

Question Tags

adl2099 (115) alarm (62) Alarm Console (82) alarms (100) alarm template (83) Automation (223) automation scipt (111) Automation script (167) backup (71) Cassandra (180) Connector (109) Correlation (69) Correlation rule (52) Cube (151) Dashboard (194) Dashboards (188) database (83) DataMiner Cube (57) DIS (81) DMS (71) DOM (140) driver (65) DVE (56) Elastic (83) Elasticsearch (115) elements (80) Failover (104) GQI (159) HTTP (76) IDP (74) LCA (152) low code app (166) low code apps (93) lowcodeapps (75) MySQL (53) protocol (203) QAction (83) security (88) SNMP (86) SRM (337) table (54) trending (87) upgrade (62) Visio (539) Visual Overview (345)
Privacy Policy • Terms & Conditions • Contact

© 2025 Skyline Communications. All rights reserved.

DOJO Q&A widget

Can't find what you need?

? Explore the Q&A DataMiner Docs

[ Placeholder content for popup link ] WordPress Download Manager - Best Download Management Plugin