Dear community,
I would like to be able to analyze all the alarms that I have in a console for a defined period for the moment.
As there are a huge amount of alarms, I was wondering if it was possible to extract all those datas, or extract the data on a Splunk or a tool that treat big amount of data.
If you have any ideas or experience a project like this, I'm willing to see the different possiblities.
Thank you in advance,
Laurine
Hi Laurine,
Interesting question.
In our DataMiner Dashboards, we have a data source called 'Queries'. With Queries you can get access to all kinds of data in DataMiner and play with it. You can for example join it with other data sources, filter it, sort it, aggregate & group it, ... You can visualize it in tables & apply grouping/filtering/coloring to it, you can visualize it in charts, you can use it as a source for a node-edge visualization, ... (see also Queries - DataMiner Dojo)
Almost every month we are extending Queries with more capabilities and/or data sources. For 10.1.9, we are expecting the 'alarm' data to be in there. In this first version, you'll get access to all your alarms and then you can play with it in the DataMiner Dashboards application. Later on we will extend the alarm data source with extra 'calculated' columns like the alarm 'close time', ...
In DataMiner versions prior to 10.1.9, the alarm data source is in soft launch, so you need to enable it manually (see Soft-launch options - DataMiner Dojo). Although you can already take a look at this data source in soft launch, I must say that its easier to use and performs better in 10.1.9.
Feel free to share your use cases & user stories, so we can think along with you to build the right queries.
Hi Pieter, thanks a lot for this answer, it’s very promising and it will help a lot.
We have a huge amount of alarms in our dataminer cluster. We are trying to find a way to reduce the amount of alarms. We were thinking of putting an hysteresis of 1 minute for each alarms (except the for the critical services).
However, we need to study how often we have flapping alarms and on which service to avoid missing some important alarms.
Moreover, due to bad weather on some site we have sometimes hundreds of alarms. We would like to not display those alarms.
That’s why analyzing which services are often in alarms, the localization, the flapping event could be very helpful for creating correlation and reducing our workflow.
This feature seems very promising. I will take a look at this data source in soft launch. If you need any other information, do not hesitate.
Thanks a lot for your detailed answer
Laurine