Fleming Testnet v6 Release - *OFFLINE*

If it happened even before split, that is sad.
Will see…

1 Like

Problem Nodes

I have two nodes which can’t join, one is mobile b/b and the other behind a router. Both fail almost immediately with the following error (nothing about IDG):

[sn_node] ERROR 2021-06-18T14:15:40.176609615+01:00 [src/bin/sn_node.rs:149] Cannot start node due to error: Routing(CannotConnectEndpoint { err: UnresolvedPublicIp }). If this is the first node on the network pass the local address to be used using --first. Exiting

Fixed IP Node

I have one cloud node with fixed IP which also repeatedly fails after a handful of attempts to connect:

[sn_node] INFO 2021-06-18T17:00:36.214840736+02:00 [src/bin/sn_node.rs:124] The network is not accepting nodes right now. Retrying after 3 minutes
[sn_node] ERROR 2021-06-18T17:06:36.219511294+02:00 [src/bin/sn_node.rs:143] Encountered a timeout while trying to join the network. Please try again later.

After the please try again later the node terminates without any indication this has happened so the above message could be made clearer and it isn’t clear why the node is exiting - presumably it is not getting a response from the network, but terminating seems undesirable.

Client

Using as client I had no problems creating test coins or downloading the MaidSafe image, both very snappy. Uploading was also much improved for me, previous to testnet v6 upload was often failing or taking a very long time but now I can upload a set of files recursively in 35 mins, total 11 MB.

Update

Following reports above that the network is dead I re-checked the following and can still download the MaidSafe image:

time safe cat safe://hygoyeyx768nkst7qqjjk8khjm67tr1n4otbctcts5j7dten43zae3ga83y > ~/safe-the-planet-t6a.png

real	0m3.430s
user	0m0.269s
sys	0m0.148s
5 Likes

We have some in place here, but along with making uploads much less network intensive I suspect download may already be good. The num clients per section can be huge, they all jump on to store as much as possible right now. So we are seeing the worst speeds right now. Solving upload speed will actually help downloads a lot IMO. We will soon see though

9 Likes

That is a catch all error that says this node cannot join as nobody can connect to it.

@yogesh has a fix incoming for this one. We are filtering the Join requests and should not be. So they are dropped silently.

Nice to hear.

5 Likes

Worth noting that I had errors with certain contacts from the config.
Some kick me out with IGD moaning, while others put me in line to join.

3 Likes

Is this still the case?

There is a bl**dy timeout in sn_client that will be gone by Monday :wink: Gotta love love love timeouts and magic numbers. So at busy times, you will think the network is not slow, but dead. We did a foot shoot putting the timeout in there. Good thing though is we know every single time we put in a timeout in an eventually consistent network it will break it in such fun-loving ways :angry:

6 Likes

Oh, that’s a new thing. Can you grab a log if at all possible?

2 Likes

For me, yes. With 1.flac.

If node fails to answer to a simple request for 3 minutes, then something much worse than just timeout happens.

1 Like

I started getting Unfortunately we are unable to establish a connection to your machine messages by the way.
Is it related to timeouts somehow?

where would it be?
using sn_node --public-addr <public ip>:<some port> --local-addr <lan IP>:<same port> --hard-coded-contacts '["xxx.xx.xxx.xxx:xxxxx"]' it does not appear to log anything. (where should I look?)
I can only see the standard responses from the terminal, unable to establish a connection , try again later or trying again in 3 minutes

1 Like

~/.safe/node/local-node/sn_node_rCURRENT.log

1 Like

Nothing there.
I only find the log there when I use safe node join.

1 Like

@Josh try running again with RUST_LOG=sn_client=trace,qp2p=debug in front, so:

RUST_LOG=sn_client=trace,qp2p=debug sn_node --public-addr <public ip>:<some port> --local-addr <lan IP>:<same port> --hard-coded-contacts '["xxx.xx.xxx.xxx:xxxxx"]'

(setting the log level may depend on OS - that should work on Linux)

2 Likes

How will we know when network splits - if no one can join :sweat_smile::sweat_smile:

2 Likes

From the logs, the network has split once so far :slight_smile:

8 Likes

Im still trying I have a virtual box set up and have port forwarding set up in router and vertual box but no dice so far is it tcp or udp i need to enable ?

$HOME/.safe/node/sn_node --public-addr xx.xx.xx.xx:4449 --local-addr 10.0.2.15:4449 --hard-coded-contacts=[\"46.101.59.28:12000\"]

Cannot start node due to error: Routing(CannotConnectEndpoint { err: Connection(TimedOut) }). If this is the first node on the network pass the local address to be used using --first. Exiting
[sn_node] ERROR 2021-06-18T17:10:30.564201784+01:00 [src/bin/sn_node.rs:149] Cannot start node due to error: Routing(CannotConnectEndpoint { err: Connection(TimedOut) }). If this is the first node on the network pass the local address to be used using --first. Exiting

I get he following:

sascha@Knut:~$ time safe cat safe://hy8ayqyp5z55pu9a7dur9mntgznm7jtihjk4se1pxqy3qg5gknc8fbgiior > Safe_Put_Cat/Waterfall_slo_mo.mp4

[qp2p::connections] ERROR 2021-06-18T18:42:25.576021087 [/home/runner/.cargo/registry/src/github.com-1ecc6299db9ec823/qp2p-0.12.3/src/connections.rs:235] Failed reading from a uni-stream for peer 188.166.150.89:49415 with error: StreamRead(ReadError(ConnectionClosed(Reset)))

[sn_client::connections::messaging] ERROR 2021-06-18T18:44:10.822428102 [/home/runner/.cargo/registry/src/github.com-1ecc6299db9ec823/sn_client-0.61.1/src/connections/messaging.rs:439] Timeout while waiting for response to client request w/ id 88f06b4d..: Elapsed(())

Error: NetDataError: Failed to GET Public Blob: SelfEncryption(Storage("Generic error(Failed to obtain any response)"))

real	4m24,309s
user	0m10,897s
sys	0m8,828s
2 Likes

Try 206.189.22.30:49067 as your hard-coded contact. The 46.101.59.28:12000 in the instructions didn’t work for me.

1 Like

Same here:

time safe cat safe://hy8ayqyp5z55pu9a7dur9mntgznm7jtihjk4se1pxqy3qg5gknc8fbgiior > Waterfall_slo_mo.mp4
Error: NetDataError: Failed to GET Public Blob: SelfEncryption(Storage("Generic error(Failed to obtain any response)"))

real	3m2.680s
user	0m7.333s
sys	0m5.291s
1 Like

It’s UDP for Safe.

2 Likes