Start a Conversation

This post is more than 5 years old

Solved!

Go to Solution

12228

November 12th, 2010 20:00

Question about PowerPath Migration Enabler for Windows 2008 Clusters

I have been using PowerPath Migration Enabler Host Copy to perform migrations from a CX3-20 to a CX4-120.  It has worked great on my windows servers enabling me to do host-based migrations with zero downtime.

Now it comes to migrating the Windows 2008 clusters.  I'm reading through the PowerPath Migration Enabler 5.5 User Guide and on pages 51-52 it discusses using PPME for clusters.  The steps seem the same until it comes to step 8.

It says:

On Windows 2003 clusters when ready to commit the migration run "powermig commit" and you are finished.

On Windows 2008 clusters it says to first "bring the failover groups offline, stopping the applications", then run "powermig commit", and finally bring the failover groups back online.

Am I reading this correctly that I must bring the groups down in order to commit the migration, but I did not have to do so on 2003?  If so, that is disappointing, because PPME has been great for all of my other migrations.  I wonder why the groups have to be shut down in 2008 when they did not need to be in 2003.

154 Posts

November 15th, 2010 11:00

Hello.  PowerPath Migration Enabler (PPME) is a non-cluster application that under limited conditions can be used to migrate in clusters. It is not aware of the persistent reservation commands.  PPME does not transfer the registration from source to target during the "commit" process.

Once a migration is committed and the cluster starts using the target/new device, it expects to find a persistent reservation registration on that device.  However, it does not have the expected registration. For that reason we need to off-line the resource group, commit the migration, then restart the groups.  This will initialize the new disk with the appropriate registration on the new device.  This doesn't apply to Windows 2003 because it doesn't use the SCSI-3 persistent reservation that Windows 2008 uses.  There will be enhancements to PPME in a coming version that will not require this step.  I hope this helps.

54 Posts

November 13th, 2010 04:00

You are reading it correctly. The basic difference between Windows 2003 Clusters and Windows 2008 Clusters is that Windows 2003 Clusters are dependent on disk signatures - whereas Windows 2008 are dependent on GUIDs. This necessitates the difference between the 2 ways of commiting the changeover.

Thankyou

Sam Claret EMC OSAPI WIndows TSEIII

13 Posts

November 15th, 2010 11:00

Thanks Sam.

Another quick question for you.  In the same document it says to "move all failover groups to the active node" and says to shut down the other nodes in the cluster.

If my cluster has multiple cluster services running and I plan on doing a migration on only one, do I still have to fail all groups to a single node and shut the other nodes down?  And if I do, is it still required to restart all services during powermig commit, or just the one where the migration took place?

I think I can sell taking the momentary outage to restart the individual cluster service, but if I have to shut down every single one of them in this cluster just to migrate one, I don't think that will fly.  It would also be beneficial for me if I could leave the remaining services running on the other nodes while the single service is running on the node that is performing the migration.

154 Posts

November 15th, 2010 12:00

This step applies to PPME-migrated devices.

13 Posts

November 15th, 2010 12:00

That is helpful information.  The only other thing I need to know is if just the resource group being migrated needs to be restarted or if all resource groups must be restarted.

Thanks.

13 Posts

November 15th, 2010 12:00

Thanks for the information.  I will report back after my migrations to let the forums know how they went.  My steps will be as follows:

1.  Assign target disks to all cluster nodes

2.  Assign all resource groups to a single node and shut down the remaining nodes

3.  Use PowerPath Migration Enabler Host Copy to perform the migration on LUNs in a single resource group

4.  Shut down only the resource group whose drives were migrated

5.  Run powermig commit

6.  Restart the resource group that was part of the migration

7.  Bring up the other cluster nodes and reassign resource groups appropriately

154 Posts

November 15th, 2010 13:00

Please add step 5a.

Run powermig cleanup.

You want to ensure that the source device is no longer available in case of a cluster failover.

154 Posts

November 15th, 2010 14:00

If you have multiple applications running across the cluster hardware and you only want to migrate one set of devices for one application, you "could" unmask just those devices from all but one of the cluster nodes.  That way, you wouldn't necessarily have to shut everything down.  Today, that is not an officially supported migration method for PPME.  I would recommend speaking with your EMC representative about an RPQ for this.  The RPQ is a request for a non-standard configuration.

13 Posts

November 17th, 2010 06:00

Thanks for the info on the unsupported method, Brion.  In this case we're working on a Clariion where all three cluster nodes are in the same storage group, so it would be tricky to remove the LUNs only from two of the nodes, but I will keep that in mind for future reference.

13 Posts

November 29th, 2010 06:00

Everything was successful with our PPME migration on a SQL 2008 cluster.  The only outage was the momentary shutdown of the failover groups to run powermig commit.

We did have an issue on a subsequent migration on a non-cluster machine.  We decided to try a migration from a thick LUN to a thin LUN (Clariion thin pool).  When we ran the powermig setup command to put the source and target into a session, the target LUNs (the thin ones) went into an "unreadable" state according to Windows Disk Management.

So this new problem is probably for a new post, but the original SQL 2008 migrations were successful thanks to all of your help.

154 Posts

November 29th, 2010 08:00

Glad to hear that it went well.  With regards to the other item that you noted, this is a known issue when new disks are added to the host.  Here is an excerpt from the PowerPath for Windows Release Notes:

"On Windows Server 2008, disks are marked offline when first added to the host. The user must manually enable these offline disks before they can be used."

I hope this helps.

13 Posts

November 29th, 2010 08:00

It's actually not that the disks are "offline", but they are listed as "unreadable".

We assigned the new disks and manually enabled them.  Up until this point everything was okay, the operating system was seeing them fine.  We initialized them as basic disks.

The problem occurred when we ran powermig setup.  As soon as the PPME session was created, the disks reported as "unreadable" in Disk Management, even though PowerPath is reporting 4/4 paths available with no errors.

If the PPME session is deleted, the disks return to a readable state.

We have a case open with EMC to investigate.  I'm assuming it has something to do with them being thin LUNs.  This is the only difference from our previously successful migrations.

13 Posts

November 29th, 2010 08:00

This is what I thought as well, but when we started the migration the host crashed.

I was tying the "unreadable" to the crash, but perhaps the unreadable note is normal behavior when the target LUN is locked and the crash was related to something else.

154 Posts

November 29th, 2010 08:00

I see.  When you kick off a PPME session with the setup, the target device is not accessible to prevent device access.  I believe that this is what you are seeing.When the migration is complete, the device would be usable again.

30 Posts

December 27th, 2011 12:00

I have a simple 2008r2 two node cluster used for file sharing. I failed/migrated all processes to a single node and shut down the other. Now I have a single server running all the services. No problem. I plan on using PPME to move the data from one CLARiiON to another. I have successfully used PPME on standalone 2008r2 servers without any problems. But I'm confused about step #4 above. Please excuse my ignorance but what is a "resource group"? In the "Failover Cluster Manager" I can't find any "resource group" to shut down? I will attempt to attach a screen shot of what I see.

Thank you.

-Brian

1 Attachment

No Events found!

Top