Randomness and Clustering

opacey · November 16, 2017, 1:16pm

Hi all, does anyone have information on what factors influenced the decision on how much redundancy gets built into network resources and storage? I was considering the nature of randomness and how elements of a truly random set will typically exhibit clustering over the dimensions of the set. Wikipedia has a good explanation…

https://en.wikipedia.org/wiki/Clustering_illusion

The illusion here is that the data isn’t truly random but in fact clustering does occur and to avoid it the distribution would have to be artificially managed and thus be LESS random.

As I understand it in SAFENet XOR is used to assign responsibility of data chunks to farmers in order to achieve random distribution (but remain deterministic). I’d guess that the number of copies should be set high enough to keep the probability of accidental clustering ultra low? Haven’t looked into it more yet but will add anything useful I find here.

Topic		Replies	Views
I feel like I should know this but I don't. How does farming and XOR Distance relationship work? Features	8	974	March 9, 2018
Data distribution related traffic taking into account the day-night-cycle Beginners	11	1154	May 30, 2015
Open questions on the security properties offered by the XOR Space closeness relationship Development	0	804	April 20, 2015
Can someone clarify please...? Features	7	867	February 29, 2016
Network efficiency and performance improvement algorithm proposal Beginners	25	756	June 3, 2021

Randomness and Clustering

Related Topics