Next step of safecoin algorithm design

dirvine · September 9, 2018, 2:08pm

It is However if you look at it like this. A section cannot be overtaken or the design should ensure this is the case, then it is no problem. Then we look at how a section can be under threat and how the neighbors can mitigate this and it becomes more interesting. Also if the client manager group for that particular wallet was also able to prove the account is valid (i.e. all last transactions (from) are maintained) then it adds more checking as per the data type.

This part is very interesting as we have always said coins should be secured every bit as much as data and if we have secured the data, then we will have secured the coins. So for data we do an extra step, we store the identifier in a data chain. If we did similar with wallet balances so they are secured and private then we may be able to say we have secured client manager account info and can make safecoin a balance item in those client managers, who must all agree.

It is quite subtle but the deeper you look the more likely it is to be solid. When you think of it, a put balance is held by client managers and if you could control that you could put a massive put balance in client accounts illegally.

oetyng · September 9, 2018, 2:20pm

This part.
If it can be done with put balance, it can be done with safecoin.
In the spirit of minimising complexity, it seems to me that the inherent problems of that approach, would be needing a less complicated solution than to solve divisibility with data item coins for example.

It was a rather elegant way to control issuance though. Any ideas as to how that could be kept, or would it be scrapped altogether?

oetyng · September 9, 2018, 2:31pm

dirvine:

What would be really nice is a way to have accounts very simple to set up, some kind of credit for storing stuff that is not junk, or perhaps some human provable interactions, like surveys or something. None of these are simple, but its always a thought.

In terms of vaults, it would be brilliant to allow them to just start automatically and earn quickly, I would think this will be possible though, if such vaults were handling lots of client interaction, getting old data that may be missing from the section, or confirming all members have the data they should and so on. I see these actions as value and therefor should be rewarded. So starting a vault that perhaps does not really join, but does a load of work for say 4 hours can earn enough safecoin to create a single account perhaps.

I was actually thinking very similar things when waking up this morning, and pondering interesting things in the bed

With probabilistic issuance of a whole coin, based on work, it’s going to take a long while for a new (and small) vault, to see results. (Well, maybe that part is up for change now?)
Optimal for uptake and gaining popularity, would be that they can join and do work and see results fast.

Maybe if the section would hold a pool from which they pay newcomers, who later return this as they start to earn more. That way we don’t need to involve this special initial case in the safecoin algo, but rather as a part of section joining algo.
Didn’t get past the normal objections of the susceptibility of gaming it and so on though, before going up and filling head with other things.

As I have mentioned before, it might well be that the very initial stage (of a network, or a vault…) needs another tooling,than the later. I think of it as leaving atmosphere. That first part is very different from the very very long travel that then goes on in space. Different conditions, needing different solutions.

JoeSmithJr · September 9, 2018, 2:51pm

That would make me sad because “coin as entity” is one of the primary differentiators of Safecoin, the property that makes it more like cash and less like a bank account. Whether this is an important property is a matter of another discussion but my personal opinion is that yes it is.

I surveyed the forum for proposals that used entity based coins with denominations:

YASDI - Bitwise Denominations – prefix based binary division (@mav, May 2017)
Yet Another Coin Idea – an overly complicated precursor of the above (@Tim87, Jan 2017)
Yet Another Safecoin Divisibility idea (YASDI) - Network has a wallet - Decimal coins – decimal divisibility (@wes, April 2017)
Safecoin divisibility - #304 by norimi – non-binary 1-2-5 denominations (@norimi, Dec 2017)
Is quasi-infinite divisibility useful? – short and meta (@jlpell, Jan 2018)
Safecoin Denominations – honorable mention as it doesn’t propose any solution just sets a direction (@betterthantrav, March 2015)

All in all, 2017 was a good year for coin denominations. Maybe it’s time to pick up pace again.

To maintain balance (such a clever pun!) here are two proposals that involve account balances:

Idea for divisible Safecoins (@polpolrene, Oct 2016)
Proposal for SAFEcoin division - read datastructure topic first - please discuss (@neo, Jan 2017)

jlpell · September 9, 2018, 3:00pm

How would it be any different than the situation where all vaults are of a fixed known size? Currently, once a vault is full and a new chunk arrives does it just fail? Who gets the new chunks to maintain 8 copies if that happens? It seems like the simplest way to manage things is to just redirect the chunk to a nearest neighbor in XOR. Sounds rather similar to our indirection defense we discussed a few months ago. So, intuitively it would seem that a redirect from the vault that is full to the vault that it next closest to the address would take care of things.

I don’t know the real answer to this so I’m just thinking out loud, it would be good to get some input/clarification from the experts.

JoeSmithJr · September 9, 2018, 3:06pm

Or, much simpler, the XOR distance can be divided by the size of the vault and then each vault will get the right number of chunks. If the section knew the size of each vault, that would simplify this and other things.

draw · September 9, 2018, 3:57pm

I assume a pre-(RFC) will follow in due time about this ‘safecoin as data items, but as integers in the client managers’-proposal, if it remains a valid idea of course.
One thing I’m curious about is how will be checked if the maximum number of Safecoins has been reached.

tfa · September 9, 2018, 4:39pm

And, to emphasize the implied scope, this includes not just communications with nodes in the current section but also with nodes in the neighboring sections, so a lot of nodes.

dirvine · September 9, 2018, 5:25pm

That can still be achieved by an array that is 32bits long, this array could split as sections split losing a leading bit at a time but then the whole array is section prefix + what is left of the array. A section would then be in charge of that part of the address space and can be queried if there are any spaces (0’s) left in the array. This is way oversimplified, but you get the idea. It’s just an array the sections need to be aware of and they will be given the array as current and they will populate or delete items (safecoins) from that array.

There are a few things can be done here to limit a sections ability to create coins etc. but also the neighbors will likely be able to also be aware of each neighbor’s array and which coins are farmed.

Even with safecoins as just integers or similar, they can still have an address if we wish. That allows further checking, but may not be required.

draw · September 9, 2018, 7:17pm

We’ll see, the devil is in the detail.
I suspect a ‘Client-Managers’-section/group can divide Safecoins the way it wants like this, if I understand correctly.
And reorder Safecoins with a section split if necessary.
You better don’t have a client with a lot of Safecoins. If a section becomes so small (after a lot of splits) that not all Safecoins of such a client with a big wallet fits in the new, smaller address space, you have to ‘split’ the client as well?
Or have a maximum number of Safecoins that 1 client can have.
Edit: or I misunderstand and the Safecoin integers are in the client managers, but the arrays are not.

dirvine · September 9, 2018, 9:52pm

This is OK, the safecoins will be from all over the network, but the array would hold what are available and what are not, if that makes sense?

Both would be, one represents the client balance and the array is only the safecoin used/available in that address range.

/brainstorm : remember : Just in case

neo · September 10, 2018, 12:06am

The issue that @happybeing post also brings up is an APP that *spends all* the users account balance. Once the APP has permission to PUT data then effectively the limit to the number of PUTs is the person’s account balance. With safecoin then when the current PUT balance is used up then permission has to be gained to spend another coin to get the PUT balance back again.

I know I heard about this idea a few months ago, but just thought when you said the balance is held in the account data (client manager), what happens to having multiple wallet IDs that an account could have if it was MDs for safecoins. Would you store this balance with the ID key pair in the account data, thus allowing multiple balance IDs (Wallet IDs)

I do agree with you here. It certainly a differentiator and nice to have a cash like quality

Transaction load was always the problem. And the ideas suggested for storing multiple coins (actual splitting of coin) in the one MD will mean huge transaction loads after a few years since it is nearly impossible to unsplit coins since so many people have “shares” in the one coin. These are basic problems that cannot be solved by different ways to do it, if you want micro payments (in fiat terms)

Here is another idea.

The safecoins are still MDs always owned by the section that looks after it.
Each coin has 10,000 fields. Thus the coin can be split into 10,000 parts of varying amounts. All parts add up to one safecoin
The fields hold the fraction (decimal format) of the coin that a person has
The wallet data structure now has coin address and field number for each portion the user has.
when payment is made it can be any amount of a coin and the appropriate PUT balance is given.
When payment is made the section will take the amount and add to the free/unallocated field.
the scarcity factor can use the address generated and allocate the remaining (unallocated) amount in the coin as the reward success. It is more likely to be a success too, just often less than a full safecoin worth.
sending 2.45632453434 safecoin is now easy since fields are used and if needed a field can be split into two fields.
There is still only 2^32 coins (MDs)
The wallet can be designed so that small values are recombined where possible. Payments to the network will be recombining within the coin whenever a sub coin value is spent. EDIT: it may even be desirable to have an API that allows the user to “spend” from one coin MD to another so that the receiving coin has the user’s two small parts of a coin combined.

Effectively there is 2^32 * 10,000 discrete amounts available and all discrete values are 1 coin or less. This allow micro, nano, even “nano nano” transactions. This solves the chicken-n-egg issue since a Android or IPhone APP can be created to accept fiat for very small coin amounts.

EDIT: also gifting enough for a person to create an account is not such an issue since its likely to thousand’s or much less of a full coin.

EDIT2: It would be possible using this to have reward amounts as say 1/10000 (or whatever fraction) of a safecoin and to give out rewards 10000 times more often.

And this requires very little change to how safecoins were envisioned to operate.

mav · September 10, 2018, 4:15am

Exploring ‘variation’ with respect to setting targets…

The network is designed to spread load out evenly (load is mainly storage and bandwidth). This is a natural consequence of using a hash to locate vaults and chunks on the network.

In an ideal world the distribution of data is perfectly equal. Every section stores and delivers exactly the same number of chunks as every other section.

However this isn’t the case in reality. It’s important to consider because the idea of ‘stress’ needs to relate back to the reality of ‘what is normal’ and ‘what is beyond normal’.

Using an example, storing 64K chunks across 64 sections, ‘normal’ would ideally be 1K chunks in each section.

But in reality the distribution is quite uneven (tested by scraping news.ycombinator.com and hashing 64K posts to get chunk names).

With 64 sections the smallest section stored 912 chunks and the largest stored 1080 chunks. The standard deviation for storage was 29.1. 90% of vaults stored between 950 and 1044 chunks. 50% stored between 984 and 1019.

So the variation (in this case) is 1000 chunks ± 8.8%. I didn’t expect there to be that much variation.

The distribution changes depending on the size of the network and the total number of chunks, but is always naturally bounded by the equality of hashing.

At what point is the network considered stressed? Or is variation simply not important when considering stress?

Variation in chunk count is just one aspect of overall variation on the network. We should also ask What is the degree of variation to be expected and accepted for:

supply of bandwidth (depends on ISPs?)
supply of storage (depends on laptop specs vs datacenter specs?)
demand for upload (depends on default smartphone camera resolution?)
demand for download (depends on … what factors … meme trends?!)
inter-vault latency (depends on geographical distribution?)
inter-section latency (depends on consensus speed?)
vaults per section (depends on xor names of vaults?)

Ideally the network algorithms manage all these fluctuations by an inherently clever design. But understanding the boundaries between normal vs stressful variations might be important. This post is a very basic start at trying to understand the magnitudes of normality.

To clarify my point about ‘variation’ within the context of the OP, if an attacker can cause a stressful fluctuation it should not preclude future participation of normal users (ie there should be a ‘return to normal’) otherwise it could cause irreversible exclusion / centralization. What is the ‘normal’ we are returning to? Is it a target value? Or is it simply ‘the balance’? How long does it take? Why? Seriously tough questions to answer…

There’s no way to avoid the network needing to operate within broad ranges of storage capacities, moderately slow and very fast bandwidths, sub millisecond to hundreds of millisecond latencies, etc… which is somewhat at odds with the ‘equal’ nature of XOR-space design and consensus design. What is the lowest acceptable common denominator and what is the impact? If we leave it entirely up to ‘the balance’ I fear vaults will become exclusive very rapidly.

Can we take parameter design out of our hands and automate them? I think so. But first we probably need some manual guidance that can steer the design of the automation. I’m sitting here feeling ‘this is damn tricky stuff’!

I can certainly see app developers saying “screw it we’ll just pay the safecoin for our customer uploads ourselves and put it in the ‘costs’ column” so the customers don’t have to engage in safecoin to get started. It’d be good to try and avoid that dynamic if possible. But it’s going to be a pretty tempting path for app developers I think. Mechanisms to avoid it (like what you suggest) are really interesting.

Yes the chunk just fails to store (to my knowledge). There’s no redirection of chunks. NotEnoughSpace error in vault:ChunkStore is a starting point down the rabbit hole, I can’t find the handler in routing but anyways that’s my understanding…

I imagine if vaults are full at a fixed size then the section adjusts itself to allow more vaults to join so the chunks are more thinly distributed.

neo · September 10, 2018, 4:27am

I guess the first attempt is to do a merge, since the most likely reason for no spare space is not enough nodes and probably needs to merge with another section.

Maybe we need a “help” message that a section can send its neighbours for a node to be relocated to that section. Then it might gain a few nodes.

Obviously these things should be done well in advance of critical shortage of space.

Of course the current proposed mechanism is to up the price of PUTs for storing in that section that is running low of spare space. Which is supposed to slow down people wanting to store files at that time.

JoeSmithJr · September 10, 2018, 4:03pm

It’s a multinomial distribution so the variance for each “bucket” is np(1-p), in this case 64000(1/64)(1-1/64)=984.375. The standard deviation (the square root of the variance) is about 31.375 which is very close to the 29.1 your measured. It’s also about 3.1375% of the number of chunks in a section.

In the more generic case (and unless I made a mistake), the ratio of the standard deviation and the average number of chunks in a section is sqrt(1-1/number_of_sections)/sqrt(average_number_of_chunks_in_a_section) and this converges to 1/sqrt(average_number_of_chunks_in_a_section) as the number of sections grow large, which is what we expect for the Safe Network.

If a chunk is 1MB, a vault stores 64 GB, and a section consists of 16 vaults (sorry if the numbers are off), then we have 1 Mi chunks in a section on average, so the standard deviation shrinks to about 0.1% of the section size. That’s not a bad number.

buttler654 · September 10, 2018, 11:32pm

Thanks for sharing this post

jlpell · September 11, 2018, 12:01am

But wouldn’t that mean that the after the new vaults join the section all the vaults would need to transfer chunks around to make sure the closest vault store the right chunks?

My impression is that as of right now when a vault fills up and throws an out of space error, it will churn. Simple and effective. This approach in of itself would appear to incentivise vault operators to start new vaults with as large capacity as possible to give them the best chance to become an elder. Not sure if this balances the needs/limitations of mobile users though…

Again, I’m just guessing. Experts?

neo · September 11, 2018, 12:03am

This was my impression too. Basically the request for a chunk goes to the section and the section either knows which vaults have the chunk or they request from all vaults the chunk and the ones with it respond.

mav · September 11, 2018, 1:50am

The current discussed options for handling the stress of low storage:

merge the section
churn (kill? relocate?) the vault with low storage
raise prices
allow more new vaults to join

Merging the section is quite a complex and stressful activity (much more than splitting). I would say using a merge to solve the problem is too hefty an action for the small magnitude of the problem. Merge should primarily be used to protect consensus, and simpler techniques should be used to manage storage stress.

Churning the vault with low storage is mildly troubling because it reduces the total storage of the section. It should be fine so long as other vaults in the section have a lot of spare space, but it seems intuitive that removing the vault adds stress in order to achieve the positive outcome of balancing the network. It’s not a bad solution but it also seems in the basket of short-term-pain for long-term-gain.

Raising prices is treating the symptom not the cause. It might slow the uploads and give the network some more time to resolve the issue, but raising prices does not itself resolve the stress. It’s a good tactic but imo only a short-term solution to employ while the slower and more difficult activities that remove the stress are happening in the background.

Adding new vaults is a slow process but it does solve the underlying issue of needing more storage. Of the solutions listed it’s the only one that allows all participants to end up net positive - cheap prices, more opportunity for rewards, larger network, more participants.

All the options are viable responses to the stress of low storage. However some are more appropriate than others depending on the prevailing circumstances, and hopefully the algorithm can use the appropriate response in right circumstances. How do we (ie the network algorithm) judge the differences?

neo · September 11, 2018, 2:14am

Remember that any action should be occurring long before critical point.

Also I remember that a vault is considered full long before all its space is used up to allow for merges and churns. It was at one point 1/2 used space was considered full

Any action has to be occurring long before any issues which allows the actions to work without needing to be super fast. So a churn event while stressful is being done while things are good still, but approaching an issue. Price rises begin long before space is an issue thus slowing down things and price has to only one of the measures.

Topic		Replies	Views
Current unsolved SAFE questions Beginners	21	2438	May 5, 2019
Is farming viable? Safe-Node	65	3683	July 22, 2019
Fraser's Safecoin Alternative Design (Postponed State) RFCs	162	5858	October 1, 2018
Vault size = usage? answer: usage (upload) = safecoin = GB of vault * time Beginners	7	584	May 26, 2019
What algorithm defines the value of vaults? Development	1	741	October 5, 2015

Next step of safecoin algorithm design

Related Topics