Unsolved

This post is more than 5 years old

23 Posts

5785

February 7th, 2014 14:00

100 % Utilization on Pool LUNS

Hi All,

We are seeing almost all day with 100% utilization on couple of LUNS on our VNX 5700 . The two LUNS are part of SQL DB (windows 2008)  and I am seeing mostly Read throughput high.  The two LUNS are part of FAST VP pool which has 10 * 2 TB NL SAS + 5 * 200 GB Flash + 10 * 600 GB SAS drives . This is causing high response times and process are running slow.

Current Tiering on the LUN is 16.76/49.89/33.35 ( EP/P/C) and other LUN 2.98/34.97/62.04 (EP/P/C) .. There is some database update/reports processing  happening on those LUNS.

The policy is set as Auto .

can someone please let me know what option/tuning I can try from storage side and to help for better performance

11 Legend

 • 

20.4K Posts

 • 

87.4K Points

February 7th, 2014 14:00

how many read/write iops do you see to those LUNs ..as well as other LUNs. Did you look in analyzer to see how many IOPS your SAS/NL-SAS are handling ? High response times throughout the day or specific times. What kind of response times do you see ?

6 Operator

 • 

4K Posts

February 7th, 2014 17:00

Are you using Thin LUN? Thin LUN is not designed for performance. You can try to migrate SQL DB files to Thick LUN or RAID group LUN (traditional LUN). RAID 1/0 would be better.

Anyway, you can collect Analyzer logs (.nar/.naz files) and send to EMC support for analysis.

How to gather the necessary information for a CLARiiON or VNX performance Analysis ?

6 Operator

 • 

1.1K Posts

February 7th, 2014 18:00

LUN Utilization: The LUN Utilization metric of the LUN Performance Bottleneck indicator measures/displays how often the LUN is serving an incoming request. A high LUN utilization does not necessarily mean that the LUN is nearing capacity. A LUN is considered busy if any of its disks are busy, so LUN utilization usually presents a "conservative" view.  That is, a high LUN utilization value does not necessarily indicate that the LUN is approaching its maximum capacity, just that at least one of its disks are busy.

Values in the columns are calculated with the folowing:

"% Utilization (avg. over period)"

"% Utilization (current)"

So it is firstly necessary to figure out which disks in the problemetic LUNs were that busy, which types of file were placed in those disks and during which period the LUN utilization ran to 100%.

Suggest you start the analysis with both storage side and application side.

On the storage side, follow this EMC KB:https://support.emc.com/kb/9132 to gather the storage performance report, check what target disk type they are and pay attention to the iops on disk for OLTP and bandwidth on disk for data warehouse in the performance report. Check how the read cache set and if the FAST cache was enabled on the LUNs and pay attention to the cache hit ratio in the performance report.

On the application side, find out the file type in those disks and what kind of database operation was processing in the peak time. Since the utilization indicates disk busy, it is also better to tune the database SQL statements and file logical distribution.

You may want to read over the below doc to get a clearer understanding on SQL and VNX performance.

https://support.emc.com/docu50157_White-Paper:-Microsoft-SQL-Server-Best-Practices-and-Design-Guidelines-for-EMC-Storage-EMC-VNX-Family,-EMC-Symmetrix-VMAX-Systems,-EMC-Xtrem-Server-Products.pdf?language=en_US

23 Posts

February 10th, 2014 09:00

also the 10k SAS drives are doing around avg 300 - 350 IOPS and NL SAS drives are barely doing any IOPS I see around less than 30 IOPS on NL SAS drives.

23 Posts

February 10th, 2014 09:00

Hi Dynamox,

I am seeing the total iops on the LUN around 5000 iops in which 88 % is READ and the response times around 50ms.

Utilization is pegged at 100 % almost all day.

11 Legend

 • 

20.4K Posts

 • 

87.4K Points

February 10th, 2014 10:00

you are pushing those SAS drives really hard, you need to start thinking about adding more drives to the pool (i would get at least 10 or  15 SAS drives)

23 Posts

February 13th, 2014 12:00

Thanks Dynamox

11 Legend

 • 

20.4K Posts

 • 

87.4K Points

February 14th, 2014 06:00

if you have FAST license and running on flare 32, shouldn't it do in-tier FAST so you don't end up with graphs like yours ?

11 Legend

 • 

20.4K Posts

 • 

87.4K Points

February 14th, 2014 06:00

FAST does not move slices based on drive capacity, it moves based on slice performance statistics.

11 Legend

 • 

20.4K Posts

 • 

87.4K Points

February 14th, 2014 07:00

Agreed ..though doesn't look like fast it-tier is doing what it's supposed to do.

Sent from my Verizon Wireless 4G LTE smartphone

February 17th, 2014 10:00

I agree with what Kevin and Dynamox have said on this thread. LUN utilization is worth looking at but certainly doesn't tell the entire story. It simply indicates how frequently a LUN is being accessed but in and of itself cannot tell us if a LUN is having a problem.

The fact that you are seeing 10k SAS drives at 300-350 IOPS definitely indicates an overload on those drives. It would be interesting to see how many IOPS the EFDs are seeing in that same pool. Looking at performance in a mixed tier pool can be complicated at times but it does sound like that pool may be in need of some additional performance tier (SAS) drives to help handle that load.

If you are indeed seeing performance issues on those LUNs you may want to gather some nar files and open a case with EMC support for review and recommendations.

No Events found!

Top