Start a Conversation

Unsolved

This post is more than 5 years old

1522

April 15th, 2011 06:00

Maximising performance of time-based query

Greetings gentlemen,

I'm embarking on a project to migrate all data from a Centera cluster to A.N.Other storage medium, the first stage of which is simply to get a list of all C-Clips on the cluster (starting at time zero). I'm keen to do this in as efficient a manner as possible, especially after reading the API docs which suggest that typically I could expect 100 results a second, which for 100 million C-Clips would take about 10 days.

So my question is simply this - if I just format my FPQueryExpression using setStartTime(0) and setType(FP_QUERY_TYPE_EXISTING), is this more efficient than if I also select particular fields using selectField()?

It seems possible to me that using selectField() might force the SDK to actually read the CDF into memory. Or does it do that anyway?

Any help appreciated,

Geoff.

April 18th, 2011 02:00

Answering my own question - it seems that querying runs about 20x faster if I don't use selectField().

208 Posts

April 25th, 2011 05:00

Hello Geoff -

My company is an EMC Velocity Partner that has developed a solution for this storage migration scenario which includes compliance support, full reporting and chain-of-custody documentation.  You can check us out at http://www.interlock-tech.com.

Best Regards,

Mike Horgan

Interlock Technology

No Events found!

Top