Unsolved
This post is more than 5 years old
1 Message
0
741
March 3rd, 2010 07:00
Celerra File Level Reporting
We have ~25TB of data on our EMC Celerra NAS, it consists of millions of small files. I'd like to be able to scan the file system on a regular basis (live would be better) import that data to a SQL DB and run custom queries on the data for things such as trend analysis, storage usage charge backs, file types, duplicate files, etc.
We have a unique situation as this storage is used for our client data within our own application. The data is stored in directories for each client and a generic user ID is assigned to each client, this user authenticates to our application not AD in anyway. The reason I mention this is because the products we've seen on the market rely on AD to report on the data, great for a corporate file server, but not for us.
What I am really looking for is a scanning engine or a command line script/command I can run against the data movers to populate a SQL DB with file information. Then I could query this DB to understand growth trends per client, client storage usage (ie. duplicate files, large files, etc.)
Is anyone aware of a way to accomplish this from the Celerra CLI?
Is there a good file level scanning engine on the market that can handle millions of small files? (Products that have been tried to date fail due to the amount of data, not to mention they run for days)
0 events found


p033692
6 Posts
0
September 25th, 2013 07:00
Hello, did you ever find a solution to this? I'm trying to do roughly the same thing. Most info I've found on Powerlink or in the forums has talked about using FLR or VisualSRM or StorageScope via Control Center. I've done this in the past, but a couple years ago I moved from a DMX to a VMAX and in the process moved from ECC to SMC for management, so I don't have StorageScope any longer (I don't think). Thanks for any feedback.
john_grantman
3 Posts
0
May 8th, 2014 22:00
Have you looked at the query commands? Some CLI commands can have tags added and query the nas database and bring back resulted that could easily be redirected to a CSV file for easy import to database.
nas_fs -query:tags
nas_server -query:tags
nas_slice -query:tags
nas_disk -query:tags
If you type in the commands at CLI you will get full syntax.
It took me quite a bit of tweaking but I was able to get good information about file systems, quotas... etc. Have not tried to pull back full crawl of file system files but though this might help.
If you work out something useful, post the results.
Thanks,
John Grantman
Service & Operations Manager
Publicis Groupe (Re:Sources IT)