Hash generation / duplicate detection

kugtre · April 1, 2014, 10:04am

Hi,

I can’t find information on how the duplicates are detected?

Do you only use a hash value for duplicate detection (what is with same hash from different files?) or do you use some additional information like hash and size and filename …? ( http://preshing.com/20110504/hash-collision-probabilities/ )

Please explain the detailed process of finding duplicates.

Thank you very much.