Circuit Switching for libp2p, also known as TURN or Relay in Networking literature.
- Overview
- Dramatization
- Addressing
- Wire protocol
- Interfaces
- Implementation Details
- Removing existing relay protocol
The circuit relay is a means to establish connectivity between libp2p nodes (e.g. IPFS nodes) that wouldn't otherwise be able to establish a direct connection to each other.
Relay is needed in situations where nodes are behind NAT, reverse proxies, firewalls and/or simply don't support the same transports (e.g. go-ipfs vs. browser-ipfs). Even though libp2p has modules for NAT traversal (go-libp2p-nat), piercing through NATs isn't always an option. The circuit relay protocol exists to overcome those scenarios.
Unlike a transparent tunnel, where a libp2p peer would just proxy a communication stream to a destination (the destination being unaware of the original source), a circuit relay makes the destination aware of the original source and the circuit followed to establish communication between the two. This provides the destination side with full knowledge of the circuit which, if needed, could be rebuilt in the opposite direction.
Apart from that, this relayed connection behaves just like a regular connection would, but over an existing swarm stream with another peer (instead of e.g. TCP). A node asks a relay node to connect to another node on its behalf. The relay node short-circuits streams between the two nodes, enabling them to reach each other.
Relayed connections are end-to-end encrypted just like regular connections.
The circuit relay is both a tunneled transport and a mounted swarm protocol. The transport is the means of establishing and accepting connections, and the swarm protocol is the means to relaying connections.
+-----+ /ip4/.../tcp/.../ws/p2p/QmRelay +-------+ /ip4/.../tcp/.../p2p/QmTwo +-----+
|QmOne| <------------------------------------>|QmRelay|<----------------------------------->|QmTwo|
+-----+ (/libp2p/relay/circuit multistream) +-------+ (/libp2p/relay/circuit multistream) +-----+
^ +-----+ ^
| /p2p-circuit/QmTwo | | |
+-----------------------------------------+ +-----------------------------------------+
Notes for the reader:
- We're using the
/p2p
multiaddr protocol instead of/ipfs
in this document./ipfs
is currently the canonical way of addressing a libp2p or IPFS node, but given the growing non-IPFS usage of libp2p, we'll migrate to using/p2p
.
Cast:
- QmOne, the dialing node (browser).
- QmTwo, the listening node (go-ipfs).
- QmRelay, a node which speaks the circuit relay protocol (go-ipfs or js-ipfs).
Scene 1:
- QmOne wants to connect to QmTwo, and through peer routing has acquired a set of addresses of QmTwo.
- QmTwo doesn't support any of the transports used by QmOne.
- Awkward silence.
Scene 2:
- All three nodes have learned to speak the
/ipfs/relay/circuit
protocol. - QmRelay is configured to allow relaying connections between other nodes.
- QmOne is configured to use QmRelay for relaying.
- QmOne automatically added
/p2p-circuit/p2p/QmTwo
to its set of QmTwo addresses. - QmOne tries to connect via relaying, because it shares this transport with QmTwo.
- A lively and prolonged dialogue ensues.
/p2p-circuit
multiaddrs don't carry any meaning of their own. They need to encapsulate a /p2p
address, or be encapsulated in a /p2p
address, or both.
As with all other multiaddrs, encapsulation of different protocols determines which metaphorical tubes to connect to each other.
A /p2p-circuit
circuit address, is formated following:
[<relay peer multiaddr>]/p2p-circuit/<destination peer multiaddr>
Examples:
/p2p-circuit/p2p/QmVT6GYwjeeAF5TR485Yc58S3xRF5EFsZ5YAF4VcP3URHt
- Arbitrary relay node/ip4/127.0.0.1/tcp/5002/p2p/QmdPU7PfRyKehdrP5A3WqmjyD6bhVpU1mLGKppa2FjGDjZ/p2p-circuit/p2p/QmVT6GYwjeeAF5TR485Yc58S3xRF5EFsZ5YAF4VcP3URHt
- Specific relay node
This opens the room for multiple hop relay, where the second relay is encapsulated in the first relay multiaddr, such as:
<1st relay>/p2p-circuit/<2nd relay>/p2p-circuit/<dst multiaddr>
A few examples:
Using any relay available:
/p2p-circuit/p2p/QmTwo
- Dial QmTwo, through any available relay node (or find one node that can relay).
- The relay node will use peer routing to find an address for QmTwo if it doesn't have a direct connection.
/p2p-circuit/ip4/../tcp/../p2p/QmTwo
- Dial QmTwo, through any available relay node, but force the relay node to use the encapsulated
/ip4
multiaddr for connecting to QmTwo.
- Dial QmTwo, through any available relay node, but force the relay node to use the encapsulated
Specify a relay:
/p2p/QmRelay/p2p-circuit/p2p/QmTwo
- Dial QmTwo, through QmRelay.
- Use peer routing to find an address for QmRelay.
- The relay node will also use peer routing, to find an address for QmTwo.
/ip4/../tcp/../p2p/QmRelay/p2p-circuit/p2p/QmTwo
- Dial QmTwo, through QmRelay.
- Includes info for connecting to QmRelay.
- The relay node will use peer routing to find an address for QmTwo.
Double relay:
/p2p-circuit/p2p/QmTwo/p2p-circuit/p2p/QmThree
- Dial QmThree, through a relayed connection to QmTwo.
- The relay nodes will use peer routing to find an address for QmTwo and QmThree.
- We'll not support nested relayed connections for now, see Future Work section.
We start the description of the Wire format by illustrating a possible flow scenario and then describing them in detail by phases.
Every message in the relay protocol uses the following protobuf:
message CircuitRelay {
enum Status {
SUCCESS = 100;
HOP_SRC_ADDR_TOO_LONG = 220;
HOP_DST_ADDR_TOO_LONG = 221;
HOP_SRC_MULTIADDR_INVALID = 250;
HOP_DST_MULTIADDR_INVALID = 251;
HOP_NO_CONN_TO_DST = 260;
HOP_CANT_DIAL_DST = 261;
HOP_CANT_OPEN_DST_STREAM = 262;
HOP_CANT_SPEAK_RELAY = 270;
HOP_CANT_RELAY_TO_SELF = 280;
HOP_BACKOFF = 290;
STOP_SRC_ADDR_TOO_LONG = 320;
STOP_DST_ADDR_TOO_LONG = 321;
STOP_SRC_MULTIADDR_INVALID = 350;
STOP_DST_MULTIADDR_INVALID = 351;
STOP_RELAY_REFUSED = 390;
MALFORMED_MESSAGE = 400;
}
enum Type { // RPC identifier, either HOP, STOP or STATUS
HOP = 1;
STOP = 2;
STATUS = 3;
CAN_HOP = 4; // is peer a relay?
}
message Peer {
required bytes id = 1; // peer id
repeated bytes addrs = 2; // peer's known addresses
}
optional Type type = 1; // Type of the message
optional Peer srcPeer = 2; // srcPeer and dstPeer are used when Type is HOP or STOP
optional Peer dstPeer = 3;
optional Status code = 4; // Status code, used when Type is STATUS
}
Setup:
- Peers involved, A, B, R
- A wants to connect to B, but needs to relay through R
Assumptions:
- A has connection to R, R has connection to B
Events:
- phase I: Open a request for a relayed stream (A to R).
- A dials a new stream
sAR
to R using protocol/libp2p/circuit/relay/0.1.0
. - A sends a CircuitRelay message with
{ type: 'HOP', srcPeer: '/p2p/QmA', dstPeer: '/p2p/QmB' }
to R throughsAR
. - R receives stream
sAR
and reads the message from it.
- A dials a new stream
- phase II: Open a stream to be relayed (R to B).
- R opens a new stream
sRB
to B using protocol/libp2p/circuit/relay/0.1.0
. - R sends a CircuitRelay message with
{ type: 'STOP', srcPeer: '/p2p/QmA', dstPeer: '/p2p/QmB' }
onsRB
. - R sends a CircuitRelay message with
{ type: 'STATUS', code: 'OK' }
onsAR
.
- R opens a new stream
- phase III: Streams are piped together, establishing a circuit
- B receives stream
sRB
and reads the message from it - B sends a CircuitRelay message with
{ type: 'STATUS', code: 'OK' }
onsRB
. - B passes stream to
NewConnHandler
to be handled like any other new incoming connection.
- B receives stream
- We've defined a max length for the multiaddrs of arbitrarily 1024 bytes
- Multiaddrs are transfered on its binary packed format
- Peer Ids are transfered on its non base encoded format (aka byte array containing the multihash of the Public Key).
This is a table of status codes and sample messages that may occur during a relay setup. Codes in the 200 range are returned by the relay node. Codes in the 300 range are returned by the destination node.
Code | Message | Meaning |
---|---|---|
100 | OK | Relay was setup correctly |
220 | "src address too long" | |
221 | "dst address too long" | |
250 | "failed to parse src addr: no such protocol ipfs" | The <src> multiaddr in the header was invalid |
251 | "failed to parse dst addr: no such protocol ipfs" | The <dst> multiaddr in the header was invalid |
260 | "passive relay has no connection to dst" | |
261 | "active relay couldn't dial to dst: conn refused" | relay could not form new connection to target peer |
262 | "couldn't' dial to dst" | relay has conn to dst, but failed to open a stream |
270 | "dst does not support relay" | |
280 | "can't relay to itself" | The relay got its own address as destination |
290 | "temporary backoff" | The relay wants us to backoff and try again later |
320 | "src address too long" | |
321 | "dst address too long" | |
350 | "failed to parse src addr" | src multiaddr in the header was invalid |
351 | "failed to parse dst addr" | dst multiaddr in the header was invalid |
390 | "connection refused by stop endpoint" | The stop endpoint couldn't accept the connection |
400 | "malformed message" | A malformed or too long message was received |
These are go-ipfs specific
As explained above, the relay is both a transport (tpt.Transport
) and a mounted stream protocol (p2pnet.StreamHandler
). In addition it provides a means of specifying relay nodes to listen/dial through.
Note: the usage of p2pnet.StreamHandler is a little bit off, but it gets the point across.
import (
tpt "github.com/libp2p/go-libp2p-transport"
p2phost "github.com/libp2p/go-libp2p-host"
p2pnet "github.com/libp2p/go-libp2p-net"
p2proto "github.com/libp2p/go-libp2p-protocol"
)
const ID p2proto.ID = "/libp2p/circuit/relay/0.1.0"
type CircuitRelay interface {
tpt.Transport
p2pnet.StreamHandler
EnableRelaying(enabled bool)
}
fund NewCircuitRelay(h p2phost.Host)
Note that there is an existing swarm protocol colloqiually called relay. It lives in the go-libp2p package and is named /ipfs/relay/line/0.1.0
.
- Introduced in ipfs/kubo#478 (28-Dec-2014).
- No changes except for ipfs/go-ipfs@de50b2156299829c000b8d2df493b4c46e3f24e9.
- Changed to use multistream muxer.
- Shortcomings
- No end-to-end encryption.
- No rate limiting (DoS by resource exhaustion).
- Doesn't verify src id in ReadHeader(), easy to fix.
- Capable of accepting connections, and relaying connections.
- Not capable of connecting via relaying.
Since the existing protocol is incomplete, insecure, and certainly not used, we can safely remove it.
We have considered more features but won't be adding them on the first iteration of Circuit Relay, the features are:
- Multihop relay - With this specification, we are only enabling single hop relays to exist. Multihop relay will come at a later stage as Packet Switching.
- Relay discovery mechanism - At the moment we're not including a mechanism for discovering relay nodes. For the time being, they should be configured statically.