Unsolved

This post is more than 5 years old

31489

June 6th, 2014 17:00

partial hang on R520 with Server 2012 STD (not R2)

we are getting a weird partial hang on our R520 and wanted to see if anyone had any suggestions.

The system will lock up for about 10-15 minutes, and slowly network connections will start to fail. It seems MUCH more likely to happen while I am RDPd in. It also seems to tie in somehow with VSS. If I use Acronis (our backup agent) then when moving between one backup and another, the system will lock. About 10-15 min later, it frees up with the below error messages. It has also done this a few times while trying to query permissions on deep folders.

----------------

Error in Windows : 1006

Automatic System Recovery (ASR) action was performed
Action performed was: Unknown
Date and time of action: Fri Jun 06 04:07:56 2014

In iDrac:

ASR0000: The watchdog timer expired.
  2014-06-06T04:07:56-0500
Log Sequence Number: 1382
Detailed Description:
The operating system or potentially an application failed to communicate to the baseboard management controller (BMC) within the timeout period.
Recommended Action:
Check the operating system, application, hardware, and system event log for exception events.
Comment: 
--------------------------
Anyone have any suggestions on what to try? Or seen this issue?

12 Elder

 • 

6.2K Posts

June 7th, 2014 14:00

Hello

You need to perform more troubleshooting to isolate the problem. Based on this information there are a lot of possibilities. The BMC watchdog timer message can be ignored. If the system freezes/locks up then it will not send the heartbeat responses to the watchdog timer. This is just a symptom of the problem.

You need to try to isolate the issue. Something appears to be maxing out a resource. Check to see if the HDD, memory, CPU, or network are being highly utilized when this issue happens.

Find out if this is happening at certain intervals. It could be some type of indexing operation. What is the HDD configuration on the server - number of drives, RAID controller, RAID levels? Is any of the storage on a storage appliance of some kind or is it all local storage?

Thanks

June 8th, 2014 06:00

Unfortunately, the system does reach a literal hang. So monitoring resources is not possible. However, right up until the hang, there doesn't seem to be high IO, no HD flashes (they actually slow down right up until the hang, then go idle) the memory is around 10% utilization, and CPU is likewise less than 10%. The server is mainly files, so there isn't high CPU/Mem demand, and the load is primarily network shares. (2Gb max on a 4Gb 4x1Gb QP LAN with failover to 2 additional GB LAN ports, all tied into a Dell 4824PowerConnect)

The timing is also random, but can be induced to hit the lockup.

6 Operator

 • 

1.8K Posts

June 8th, 2014 11:00

Google                 Acronis  "windows 2012"  lock up                                  or similar wording.

Personally I have had lockups/blue screens issues on a few machines with Acronis services resident, disabled the services.

 

June 10th, 2014 10:00

I've had the services disabled for a while now, and the lockups still happen.

I will attempt a full uninstall of acronis in the next few days. But I don't think it is acronis causing the issue, especially since the issue still happens with acronis disabled.

No Events found!

Top