Unsolved

This post is more than 5 years old

132449

January 22nd, 2016 11:00

Extremely slow SAN performance for VMs

I was hoping someone might be able to assist with some major performance issues we are having in our production environment. (NOTE: I inherited this setup, sans some recent changes to the swtich cabling, when I started with the company)

Setup:

1 EQL PS6100e iSCSI SAN (24x1TB NL-SAS 7200rpm drives) with one monolithic LUN

~50 VMs running off of the SAN with at lest 1 volume per VM, some with 2, spread across 10 VM hosts

iSCSI traffic is running through a PC7048P switch, on its own VLAN (FW 5.1.1.7)

VM traffic is running through a second PC7048, on 2 VLANs (FW 5.1.8.2)

VM hosts are running Free ESXi 5.5 update 2. Most have 2 NICs for VM traffic and 2 NICS/kernel ports for iSCSI with multipathing

The main issue we are seeing is that our hosted clients are experiencing extreme lag when running their primary application (2 applications, one runs against an Advantage database server that also houses the roaming profiles data directory, and the other runs against a separate POSTgreSQL database server). This has essentially rendered these applications nigh unusable.

Is this something that could be configuration based (i.e. not using a 1:1 LUN to Volume for the database servers) or some other inherent issue I could be looking for? At this point, we are probably going to migrate the database VMs to a local storage VM host setup (5 SATA 3Gb/s in RAID 5) to try and eliminate some of the severe latency issues while we look at and probably purchase new servers with sufficient 10K or 15K SAS drives to support our needs.

3 Posts

February 29th, 2016 10:00

Hi, we dont have a SAN attached to a VMware System , but in  a Hyper-V Enviroment , but 50 VM on one Member can bear some Problemes if it is a Near Line SAS with only 7200rpm Drives.  

If you look inside the SAN HQ you might not see the spikes , as the Frontend is not updated to often.

If you have Performance Issues on you mind , go to the SAN Headquarter and do a live Session Path : [tag:group]/I/O/Live View Sessions ... you can log up to 10 Minutes in Steps up to 1Sek .. not more.

But there you will see, that 50VM are not be meant to be put on a Near Line SAS Member , even if it is configured with Raid10 IOPS and latency will not work on the average for "normal" Server, if you put a SQL Server on it , it just wont work.  ( I would expect a maximum of up to 1400IOPs for the System "IF" Raid10 )

No Events found!

Top