Unsolved
This post is more than 5 years old
31 Posts
0
9012
March 21st, 2014 05:00
Questions on DataDomain and DD Boost
Hello,
Our backup environment is tape-only at the moment. We are strongly encouraged to get rid of D2T backups and designate DataDomain as a primary backup target for all backups regardless of their nature. Tape would be used for cloning savesets from DataDomain for long-term retention. On top of that, we are strongly advised to get rid of incremental backups and use only full backups. Finally they insist on using Direct Client whenever possible.
I have a lot of words of criticism om this approach, but first I'd wish to know your thoughts on this "solution" to get if your counter-arguments match mine.
No Events found!


ble1
6 Operator
•
14.4K Posts
•
56.2K Points
1
March 21st, 2014 05:00
I still use incremental backups - nothing wrong there and I still see no point of replacing incrementals with fulls when it comes to file system backup. Client direct is really nice - it comes with price tag, but I can surely see benefits.
LaBounty1
10 Posts
0
March 21st, 2014 13:00
Client direct really sped up our backups, by eliminating the bottleneck thru the storage nodes.
We run FULLs on Friday, INCR the rest of the week....all to Data Domain. Keep 30 days of it all on DD and clone only the FULLs to tape, for long term retention.
Bancal
31 Posts
0
March 25th, 2014 22:00
My concerns about this solution are below:
The downsides of full backups are:
a) Long backup window. Backup window of full backup is much longer than one of incremental backup.
b) Negative performance impact on applications because full backups may not be completed till business hours.
c) LAN bandwidth is not unlimited: backups might take longer than expected due to network bottlenecks.
ble1
6 Operator
•
14.4K Posts
•
56.2K Points
2
March 26th, 2014 04:00
By reading what you said, well - what you copy and pasted as I see you have messed up order list - I assume you have 2 sides shouting to your ear "pick me, pick me!" and you do not know what to do at this point. Do a test. Get DD box, test it and see how it works for you. If you like it, buy it. If not, don't. I can't say what you have in your environment so any statement like "this is best for you" would be wrong, but generally you can't go wrong with de-dupe appliance and DD is really best in that area. And being integrated with NW comes as really nice bonus.
Bancal
31 Posts
0
March 26th, 2014 08:00
Actually, I'm the author of this document you criticised so vigourously. And yes, it is full of fears on designating DD as primary backup target. Please get me right, I'm not against DD, but it is unreasonable to use it as a backup target for all clients. I'd rather put archive logs on some AFTDs, despite of hampering dedupe ratio.
Turning to you questions,
1. No data classification is not implemented yet. I'm in the process of in-home audit + we asked for external audit from EMC.
Our environment is rather modest: we do about 250 Tb a month, 2000 backup sessions a day, all backups are D2T, Volume of incremental backup is about a tenths of volume of full backups. Most of storage nodes are Oracle servers. Nothing peculiar.
I'm just anxious about putting all backups to a smart applience I don't completely srust.
2. Don't know that BBB means in this context. Apparently, we don't use that.
3. Ok, we'll direct one of our "fat cats" to DD and see.
4. So you mean that rehidration overhead may be ignored when restoring/ cloning from DD.
5. OK, Iknow that DD Boost is built-into NW client code. But from functional standpoint it is a pluging that may be activated or deactivated. I wonder if you used Client Direct option extensively. EMC ads on youtube EMC NetWorker 8.0 Client Direct Demo - YouTube are promising, but I wonder how long would it take to dedupe 100000 files on the client side? Plus, EMC states Ditect Client options requires that both NW Client and DD be on the same network. Is that IP-network (VLAN). So if my environment is chopped with firewalls, I'll have install dedicated backup interfaces, use 802.1q etc.
ble1
6 Operator
•
14.4K Posts
•
56.2K Points
0
March 26th, 2014 13:00
I really don't see why? I have around 30000 sessions of archive logs daily (SQL, SAP and Oracle). When I compare previous ratio I had with VTL without de-dupe and what I have now (using DDBoost) I see benefit there. And this is without latest DDOS which brought some enhancements to Oracle multiplexing (which I also tested during POC) and have not seen any issues to be concerned. I was also skeptic, but the best way to address your questions is to actually - test it yourself.
LaBounty1
10 Posts
0
March 26th, 2014 14:00
I haven't read any discussion from the "restore" perspective.
Comparing restores from DD vs tape....you get to eliminate the time and hastle of waiting for the tape to come back from offsite storage, as well as faulty tape and tape drive issues
We're using approx 200TB of the total capacity of our DD. We're getting 10X deduplication. That equates to approx 2PB of backed up data.
Or....2,500 LTO4 tapes!
Another consideration....
Using the DD via a CIFS/NFS mount point to store SQL/RMAN generated copies of databases. We all know the DBAs will keep multiple copies or their DBs. Why store those copies on primary storage? DD space is cheaper PLUS you get to take advantage of dedupe*.
*Important note - make sure the DBA's don't compress their backups before writing to DD.
Bancal
31 Posts
0
March 26th, 2014 23:00
1. Actually I had IBM Protectier with dedupe option in another company I used to work. Basically we put on МЕД something like /, /opt, /var and drive C:| backups that is the sort of data that pudupes well. As for putting archive logs on DD, we'll give it a try, but for me the best destination for logs is AFTDs.
2. Or, I see. We don't use block-based backups here, but I'm quite familiar with this technique (NetBackup FlashBackup).
3. Ok, so use clone-controlled replication aka optimized duplication where NW server controls and DD does the real job. Now we are strongly encouraged to do cloning from DD to tape for long term retention and perform that cloning in the daytime. That is why I'm so concerned about negative effect of such cloning. Another issue I see now is how Client-direct backups and CIFS/NFS backups are cloned to tape. I know that DD might emulate VTL and see no problem with cloning from virtual tape to physical, but how client-direct backups are cloned to tape?
4. So having DD and Client direct-enabled client in different VLANs is not an issue. I suppose I will have to open some TCP port in order to enable Direct client to work?
ble1
6 Operator
•
14.4K Posts
•
56.2K Points
0
March 27th, 2014 02:00
Bancal
31 Posts
0
April 8th, 2014 09:00
Hello,
To those guys who clone from DD to tape. I have a volume of backups that should be cloned from DD to tape on the daily basis and cloning window (at the daytime). Hence, I have calculated cloning throughput. Now I want to know how many tape drives I need to clone all the data. I would be nice if someone share their cloning speed (per session) from DD to tape.
ble1
6 Operator
•
14.4K Posts
•
56.2K Points
0
April 8th, 2014 10:00
This will depend on DD model, which disks you use and which throughput DD can give you. In some instances it may depends on number of disk shelves you have as there is sweet spot there too sometimes. So, there is no straight forward answer unless you can provide those details plus what volume you wish to move over. Cloning is no different that restoring for DD in respect to speed (when writing to tape) so your question can be answered by anyone who ever did such measurement.
bingo.1
2.4K Posts
0
April 8th, 2014 14:00
To give you an idea how we do it ...
We actually clone fulls which were created during the first week of each month.
We have sorted this data to go to appropiate DDBoost devices. We have
2 SNs, each running
4 DDBoost devices (for the no-clone data pool)
6 DDBoost devices (for the clone data pool)
6 LTO5 tape drives (one for each DDBoost device)
Since we implemented DDBoost it seems that we can achieve the max data rate for the tape drives (about 150..160MB/s).
And we still run backups during the day.
DeaconZ28-2015
252 Posts
0
May 6th, 2014 05:00
LaBounty
You said "DD space is cheaper PLUS you get to take advantage of dedupe*". In my experience, ES30 shelf is twice as expensive as a VNX shelf of the same disks. And now they appear to be the same hardware. I know this because I just priced one for my DD670 and the cost was insanely high, so much so that we are looking at another solution.