Question

Solved2.13K views17th July 2023ben support

0

Aejith Antony [DevOps Member]92 1st March 2022 3 Comments

For Example : I have to monitor a switch or other network devices but i don’t have the list of parameters so, How can i know which parameter needs to be monitored. May be it could be client Decisions but skyline have any reference list for that?

Marieke Goethals [SLC] [DevOps Catalyst] Selected answer as best 17th July 2023

Alberto De Luca commented 1st March 2022

If there is protocol already built, I’d start from the parameters listed for the Alarm template and Trend Template – is this CISCO? Juniper?

Aejith Antony [DevOps Member] commented 1st March 2022

Hi Alberto,

Cisco Switch

Alberto De Luca commented 1st March 2022

I see, sharing my experience below.

2 Answers

1

Ben Vandenberghe [SLC] [DevOps Advocate]9.74K Posted 1st March 2022 1 Comment

Indeed Alberto.

I would personally add that the decision on which metrics to monitor will typically depend on different things. First of all, especially for an IP switch as referenced here, it will largely depend on the kind of environment. Managing an office network or an IP network that carries media flows for example is a completely different task, and that would be reflected also in your choice of metrics. Secondly it also could depend on your overall policies and strategies. E.g. I know that people sometimes work with a policy that a severity critical is only used for alarms that are guaranteed to be catastrophic for the services (i.e. your service is guaranteed to be down), major for events that are highly likely to be service affecting at least, etc. So your choice of metrics and the severity used might be impacted by such policies.

I note that you mention that you do not have an overview of the available metrics. But note that if you go to the default Alarm Template, it will show you all the metrics (note that there is a button on the top right that allows you to filter the list to the actual monitored metrics, or to see all of the metrics available – so you may have to switch that if not all metrics are available). Note that you can also go to the driver of the element, and if you right click on the version, you have an export option there to export all the metrics in CSV format, and this will give you all the metrics available for that type of element.

Also note that if you enable alarming on a specific metric, there could already be some default alarm thresholds that are suggested (that is the case if the driver does include default suggestions for that metric).

Let us know if this helps or feel free to further elaborate on what it is that you wonder about, and we can see if we can provide some more specific input.

Marieke Goethals [SLC] [DevOps Catalyst] Selected answer as best 17th July 2023

Aejith Antony [DevOps Member] commented 2nd March 2022

OK . I will go with this Opinion and I will update asap .

Thank you Ben

If there is protocol already built, I’d start from the parameters listed for the Alarm template and Trend Template – is this CISCO? Juniper?
OK . I will go with this Opinion and I will update asap .

Thank you Ben

score 1 · Answer 1 · 2022-03-01T11:55:27+00:00

There are several strategies you can follow – normally you’d need some input from the system owners (e.g. if a set of ports is known to be unwired / admin down, and if they want to ignore them in the monitoring criteria or if they want to be alerted when their state changes).

A first iteration could establish what are the relevant KPIs, which ones need alarming, which ones need trending – the following iterations will refine the monitoring criteria until you have a set of rules that meet your DoD “definition of done” and you can roll out in the production environment.

Other approaches also work, however multiple iterations (between DM admins and system owners) will allow you to have less false alarms in your environment. Once it’s done for one element, it’s releatively quick and easy to extend to all the other elements of the same type in your DMS (I’m saying type, not protocol, as a LEAF node would possibly have a different set of KPIs when compared to a SPINE, even if they are polled with the same DataMiner protocol, hence the importance of dealing with the system owners to establish your monitoring criteria).

So, I need to check with system owners or manufactures. As Well as if already created drivers against elements i can refer that right?
Yes, there are several parmaters you’ll find defined in a DM protocol, especially in complex ones for network equipment: bandwidth, discarded packets, state of the ports… are essentially something that could be monitored, but to what extent can be determined only with a good knowledge of your service architecture.

How we can find parameters for monitoring individual elements in a network?

2 Answers