Skip to content
DataMiner DoJo

More results...

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Search in posts
Search in pages
Log in
Menu
  • Blog
  • Questions
  • Learning
    • E-learning Courses
    • Empower Replay: Limited Edition
    • Tutorials
    • Open Classroom Training
    • Certification
      • DataMiner Fundamentals
      • DataMiner Configurator
      • DataMiner Automation
      • Scripts & Connectors Developer: HTTP Basics
      • Scripts & Connectors Developer: SNMP Basics
      • Visual Overview – Level 1
      • Verify a certificate
    • Video Library
    • Books We Like
    • >> Go to DataMiner Docs
  • Expert Center
    • Solutions & Use Cases
      • Solutions
      • Use Case Library
    • Markets & Industries
      • Media production
      • Government & defense
      • Content distribution
      • Service providers
      • Partners
      • OSS/BSS
    • DataMiner Insights
      • Security
      • System Architecture
      • DataMiner Releases & Updates
    • Agile
      • Agile Webspace
      • Everything Agile
        • The Agile Manifesto
        • Best Practices
        • Retro Recipes
      • Methodologies
        • The Scrum Framework
        • Kanban
        • Extreme Programming
      • Roles
        • The Product Owner
        • The Agile Coach
        • The Quality & UX Coach (QX)
    • DataMiner DevOps Professional Program
  • Downloads
  • More
    • Feature Suggestions
    • Climb the leaderboard!
    • Swag Shop
    • Contact
      • General Inquiries
      • DataMiner DevOps Support
      • Commercial Requests
    • Global Feedback Survey
  • PARTNERS
    • All Partners
    • Technology Partners
    • Strategic Partner Program
    • Deal Registration
  • >> Go to dataminer.services

Some elements of various protocols start in error state after agent restart

Solved814 views11th August 20210x80040226 General database failure
2
Christhiam Aaron Calatayud Franco [DevOps Advocate]85 6th August 2021 0 Comments

I recently had to restart one of my agents for another reason

I then happened what I will describe below.
Once started I noticed that 13 of the 310 elements of various protocols started with the following common error message:
Initializing the protocol for ELEMENT_NAME failed. General database failure. (hr = 0x80040226)

When validating log of the elements I do not observe errors and even refers that the elements initialized normally, example:
CElement :: NotifySNMPManagers | DBG | 5 | Notified SNMPManagerV2 of active 26416/540. The operation completed successfully. (hr = 0x00000000)
**********

When trying to start the element I observe that the option to start is not available so I choose the only viable option “restart” and indeed the element starts normally.

I remember that this had happened before but I consider it an isolated case until today.

I would like to know if there is a reference to the error codes to move forward with a more in-depth investigation of the root cause.

I hope you can guide me in this regard.
Greetings

Christhiam Aaron Calatayud Franco [DevOps Advocate] Selected answer as best 11th August 2021

1 Answer

  • Active
  • Voted
  • Newest
  • Oldest
2
Brent Alleweireldt [SLC]1.53K Posted 11th August 2021 3 Comments

Hey Christhiam,

A comprehensive list of the error message and their meaning can be found Here.

Initializing the protocol for ELEMENT_NAME failed. General database failure. (hr = 0x80040226) is probably the culprit here and points towards issues with detching the correct data from the database.

A good point to start here might be to check SLDataGateway and SLDBConnection logfiles for issues.

Brent Alleweireldt [SLC] Posted new comment 12th August 2021
Christhiam Aaron Calatayud Franco [DevOps Advocate] commented 11th August 2021

Hi Brent,
Thank you very much for the list of error codes.

I’ll see what I find in the referred logs
Regards

Christhiam Aaron Calatayud Franco [DevOps Advocate] commented 11th August 2021

Hi Brent,
I had to restart the agent again but this time I took the opportunity to restart the DB.
On this occasion, the number of elements that started in an error state increased to 21
Look in the logs you recommended and I found the following message that is repeated many times in SLDBConnection:
2021/08/11 16:52:30.902|SLDBConnection|SLDBConnection|INF|0|76|CassandraConnection.ExecuteAsync (INSERT INTO datapoints(“d”,”e”,”p”,”w”,”i”)VALUES (?,?,?,?,?)USING TTL ?;) – Exception: DBGatewayException(SLCassandraClassLibrary.DBGateway.Cassandra.StorageManagers.SingleNode.CassandraConnection,,UNKNOWN) (Code: 0x80131500) SLDataGateway.Types.DBGatewayException: CassandraConnection.ExecuteAsync (INSERT INTO datapoints(“d”,”e”,”p”,”w”,”i”)VALUES (?,?,?,?,?)USING TTL ?;) – Exception: System.AggregateException: One or more errors occurred. —> Cassandra.NoHostAvailableException: All hosts tried for query failed (tried 127.0.0.1:9042: BusyPoolException ‘All connections to host 127.0.0.1:9042 are busy, 2048 requests are in-flight on each 2 connection(s)’)
at Cassandra.Requests.RequestHandler.GetNextValidHost(Dictionary`2 triedHosts)
at Cassandra.Requests.RequestExecution.d__13.MoveNext()
— End of inner exception stack trace —
—> (Inner Exception #0) Cassandra.NoHostAvailableException: All hosts tried for query failed (tried 127.0.0.1:9042: BusyPoolException ‘All connections to host 127.0.0.1:9042 are busy, 2048 requests are in-flight on each 2 connection(s)’)
at Cassandra.Requests.RequestHandler.GetNextValidHost(Dictionary`2 triedHosts)
at Cassandra.Requests.RequestExecution.d__13.MoveNext() Cassandra.NoHostAvailableException: All hosts tried for query failed (tried 127.0.0.1:9042: BusyPoolException ‘All connections to host 127.0.0.1:9042 are busy, 2048 requests are in-flight on each 2 connection(s)’)
at Cassandra.Requests.RequestHandler.GetNextValidHost(Dictionary`2 triedHosts)
at Cassandra.Requests.RequestExecution.d__13.MoveNext()
— End of inner exception stack trace —
at SLCassandraClassLibrary.DBGateway.ExceptionHandlers.ExceptionHandler.handle(DBGatewayException exception)
at SLCassandraClassLibrary.DBGateway.Cassandra.StorageManagers.SingleNode.CassandraConnection.c__DisplayClass175_0.b__0(Task`1 c)
Cleaned Stack !!!
**********
Can you guide me regarding: BusyPoolException?

As always I appreciate your guidance
Regards

Brent Alleweireldt [SLC] commented 12th August 2021

The BusyPoolException is an unexpected one. Essential a cassandra node can only handle so many concurrent requests (2048) when the driver notices a host is handling this amount it will try the next host according to the loadbalancing policy. If all the nodes in the setup are at the max, cassandra will throw the BusyPoolException.

So this leads us to believe that some element or protocol is currently spamming the database with too many read/write requests. A way to investigate this would be to stop the dma, use “C:Skyline DataminerToolsChange Element States Offline.exe” to stop all the elements. Start the dma, once the dma is started you can start the elements in small batches (10 or so). By doing this you can start narrowing down the elements that could cause issues.

Please login to be able to comment or post an answer.

My DevOps rank

DevOps Members get more insights on their profile page.

My user earnings

0 Dojo credits

Spend your credits in our swag shop.

0 Reputation points

Boost your reputation, climb the leaderboard.

Promo banner DataMiner DevOps Professiona Program
DataMiner Integration Studio (DIS)
Empower Katas
Privacy Policy • Terms & Conditions • Contact

© 2025 Skyline Communications. All rights reserved.

DOJO Q&A widget

Can't find what you need?

? Explore the Q&A DataMiner Docs