Please take a look at Craig Ball’s informative blog post on deduplication and my comment. Craig explains why deduplication isn’t more widely used even though it makes ediscovery faster and cheaper, and he describes the main methods of deduplication. I add my view that reasonable counsel should agree on a bilateral deduplication method before any deduplication is done, and I include a few considerations about how counsel should use proportionality to determine what method is appropriate under the circumstances.
Search All Posts
algorithm black box collaboration collection competence computational linguistics confidentiality cooperation corpus linguistics Craig Ball de-duplication deduplication document review ediscovery ediscovery industry ediscovery protocol ediscovery software email esi FRCP FRE Gordon V. Cormack hashing Jason R. Baron key phrases key players key words lexical litigation Maura R. Grossman metadata Paul Grimm predictive coding privilege project management proportionality search Sedona seed concept seed sets statistics teamwork technology-assisted review workflow work product