Well, it is not easy to find out what data is on the Avamar server as all data is deduplicated. From my experience, we can start from DPN SUMMARY report. Exporting to an excel file and using excel features to find out what clients contribute much new data.
There are also other possibilities, for example, the GC doesn't clean the old data. We will need to do a heath check on this grid.