[pve-devel] has somebody already tested corosync3 alpha et new knet transport ?

Thomas Lamprecht t.lamprecht at proxmox.com
Wed Jun 27 08:44:51 CEST 2018


On 6/27/18 8:35 AM, Alexandre DERUMIER wrote:
>>> FYI, knet is a abstraction layer, it still uses udp (aka multicast) 
> 
> Are you sure ? 
> 

I was but now not anymore, no ^^ Need to take a look at the code...

> 
> http://build.clusterlabs.org/corosync/presentations/2017-Kronosnet-The-new-face-of-corosync-communications.pdf 
> 
> Currently supported
> ○ UDP (unicast)
> ○ SCTP (connection-oriented)
> ○ Loopback (for localhost only … obviously)
> ○ No multicast, but could be added if really wanted
> ○ No broadcast
> ■ We are no longer that insane
> 
> 
> so udp unicast and sctp.
> 
> 
> and 
> 
> http://people.redhat.com/ccaulfie/docs/KnetCorosync.pdf
> 
> 
> Other options in the interface section do just what you might expect.
> mcastport: <n> 
>   tells knet to use that port number <n> for communication,. The default remains the old one of 5405
>   +linknumber, but you can override it per link here. Even though knet doesn't do actual multicasting
>   the name remains for old time's sake
> 
> 
> If it's really working without multicast, with lower latencies, that's a big improvement :)

yes would be :)

> 
>>> Are there links to the presentation, could be interesting :) 
> I'll try to get it. But it's was more about casual consistency vs paxos.
> The guy is only begining to implement his container orchestrator (in rust )
> 

Thanks!

> 
> 
> ----- Mail original -----
> De: "Thomas Lamprecht" <t.lamprecht at proxmox.com>
> À: "pve-devel" <pve-devel at pve.proxmox.com>, "aderumier" <aderumier at odiso.com>
> Envoyé: Mercredi 27 Juin 2018 08:02:30
> Objet: Re: [pve-devel] has somebody already tested corosync3 alpha et new knet transport ?
> 
> Hi, 
> 
> On 6/26/18 10:54 PM, Alexandre DERUMIER wrote: 
>> I have found this presentation about coming corosync3 (seem to be alpha recently) 
>> http://build.clusterlabs.org/corosync/presentations/2017-Kronosnet-The-new-face-of-corosync-communications.pdf 
>> with the new kronosnet (knet) transport. 
>>
> 
> Yes, tracking it somewhat since over half a year, looks really 
> good on paper but didn't not have yet time to do much testing - 
> as it'd be PVE 6.X timeframe anyway. 
> 
>>
>> Latencies results are really impressive and no more multicast ! (users will be happy ;) 
> 
> FYI, knet is a abstraction layer, it still uses udp (aka multicast) 
> As you do not get to handle a lot of links with a lot of nodes without 
> multicast - i.e., multicast is a very good thing, even if some hosting 
> environments and switch default settings are against it :) 
> It can also uses SCTP as transport method, which is a layer 4 protocol, 
> on the same level as UDP or TCP - i.e., it's not encapsulated in those. 
> 
>> and a lot of others improvments (dynamic mtu, ifdown/iup without breaking cluster, and seem to be compatible with corosync2 (with udp, udpu transports) 
>>
>> I'm still looking to make bigger proxmox clusters in the future :) 
> 
> Yes, looks definitly nice and it's on our radar, I'll try to build 
> a corosync 3 package if got a bit time to spare. 
> 
>> BTW, I was at a kubernetes/container conference at Paris today, 
>> and a talk of a guy was about trying to create in own orchestrator instead kubernetes (because of problem with etcd, network lag brigging down k8s master,...), 
>> talking about clusters, paxos, strong consistency. 
>>
>> He's looking to use a causal consistency model instead strong consistency, I never heard about this, 
>> but this seem really great to be able to manage bigger cluster, and also geo clusters. 
> 
> You can do more in parallel with it. In strong consistency models all 
> events (for our case, write/read operations) are ensured to be ordered. 
> If node A sees write OP-A happen before write OP-B then this principle 
> guarantee that all other nodes see OP-B after OP-A. 
> Casual consistency does this too, but only if OP-A and OP-B are related, 
> i.e., they affect each other (like a write to the same file would). 
> 
> Are there links to the presentation, could be interesting :) 
> Seems they use a a protocol named "cure" for the update replication: 
> https://pages.lip6.fr/Marc.Shapiro/papers/Cure-final-ICDCS16.pdf 
> 
>> He have given a link to an opensource key value store using causal consistency, called "antidote" 
>> https://syncfree.github.io/antidote/ 
>>
>> Maybe for the future (proxmox 10 ;), it could be great to have this kind of model. 
>> (I'm not enough expert to say if it could work, and If it could be possible to reimplement pmxcfs with this kind of protocol, and manage others things like pve-crm/lrm) 
> 
> Hmm, an academic erlang project with a bit short whitepaper, 
> I'm a bit wary on such projects - but sounds definitively interesting. 
> 
> _______________________________________________
> pve-devel mailing list
> pve-devel at pve.proxmox.com
> https://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
> 






More information about the pve-devel mailing list