Building effective deduplication index in the memory could reduce disk access times and enhance chunk fingerprint lookup speed. which was a big challenge for deduplication algorithms in massive data environments. As deduplication data set had many samples with high similarity. a deduplication algorithm based on condensed nearest neighbor rule. https://www.ngetikin.com/