ipfs-cluster

Author	SHA1	Message	Date
Hector Sanjuan	10fa7a13b5	Allow selecting pintracker with ipfs-cluster-service License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-14 19:57:14 +02:00
Hector Sanjuan	87f4fcf958	Make the adder modules ipld.DAGService modules. This removes a bunch of the channel dance and block forwarding by having the adder submodules be DAGServices themselves and take Add() directly from the ipfsAdder. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Hector Sanjuan	ffd36afbb6	fix a few TODOs and FIXMEs License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Hector Sanjuan	a96241941e	WIP License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Wyatt Daviau	504d08d06c	Addressing first round of comments License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:23 +02:00
Wyatt Daviau	4a5a613d94	rpc call and config added License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:23 +02:00
Hector Sanjuan	bfdd59735c	Fix #240 : Add test flags. Remove build tags for logging control. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-16 09:28:40 +01:00
Hector Sanjuan	2a6f91cd35	Fix #240 : Avoid duplicated list of facilities This puts some sanity in this. It's not super correct (name of facilities depend of the component and the main cluster component should not hard code them), but it's clear enough. Imho, better than over-engineering a more elegant approach. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-15 16:50:40 +01:00
Hector Sanjuan	6a243df4da	api: Support /ws/ and /dns/ multiaddresses parsing. Log all errors The multiaddresses protocols for websockets and dns are only registered with init() function when loading the modules. ipfs-cluster-ctl uses just the api, which did not load these modules so converting from serialized types caused bad panics. We have also ignored errors in the api library under the thinking that it would only parse things serialized by us, but this has made parsing errors to go unnoticed. From now, all errors are logged and some precautions are taking to better handle the possibility of nil objects. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2017-12-08 01:05:49 +01:00
Wyatt	47b744f1c0	ipfs-cluster-service state upgrade cli command ipfs-cluster-service now has a migration subcommand that upgrades persistant state snapshots with an out-of-date format version to the newest version of raft state. If all cluster members shutdown with consistent state, upgrade ipfs-cluster, and run the state upgrade command, the new version of cluster will be compatible with persistent storage. ipfs-cluster now validates its persistent state upon loading it and exits with a clear error in the case the state format version is not up to date. Raft snapshotting is enforced on all shutdowns and the json backup is no longer run. This commit makes use of recent changes to libp2p-raft allowing raft states to implement their own marshaling strategies. Now mapstate handles the logic for its (de)serialization. In the interest of supporting various potential upgrade formats the state serialization begins with a varint (right now one byte) describing the version. Some go tests are modified and a go test is added to cover new ipfs-cluster raft snapshot reading functions. Sharness tests are added to cover the state upgrade command.	2017-11-28 22:35:48 -05:00
Hector Sanjuan	2bbbea79cc	Issue #49 : Add disk informer The disk informer uses "ipfs repo stat" to fetch the RepoSize value and uses it as a metric. The numpinalloc allocator is now a generalized ascendalloc which sorts metrics in ascending order and return the ones with lowest values. License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-03-27 20:40:49 +02:00
Hector Sanjuan	03a931b8df	Fix logging License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-03-14 17:32:00 +01:00
Hector Sanjuan	c2faf48177	Issue #18 : Move Consensus and PeerMonitor to its own submodules License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-03-13 18:40:35 +01:00
Hector Sanjuan	9deb56b762	Improve raft log forwarder It was very idiotic poll a buffer/read it and then write rather than just have a custom writer. License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-02-09 22:31:04 +01:00
Hector Sanjuan	34fdc329fc	Fix #24 : Auto-join and auto-leave operations for Cluster This is the third implementation attempt. This time, rather than broadcasting PeerAdd/Join requests to the whole cluster, we use the consensus log to broadcast new peers joining. This makes it easier to recover from errors and to know who exactly is member of a cluster and who is not. The consensus is, after all, meant to agree on things, and the list of cluster peers is something everyone has to agree on. Raft itself uses a special log operation to maintain the peer set. The tests are almost unchanged from the previous attempts so it should be the same, except it doesn't seem possible to bootstrap a bunch of nodes at the same time using different bootstrap nodes. It works when using the same. I'm not sure this worked before either, but the code is simpler than recursively contacting peers, and scales better for larger clusters. Nodes have to be careful about joining clusters while keeping the state from a different cluster (disjoint logs). This may cause problems with Raft. License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-02-07 18:46:09 +01:00
Hector Sanjuan	89ecc1ce89	Encapsulate Raft functions better and simplify the Consensus component License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-02-02 13:51:49 +01:00
Hector Sanjuan	6c18c02106	Issue #10 : peers/add and peers/rm feature + tests This commit adds PeerAdd() and PeerRemove() endpoints, CLI support, tests. Peer management is a delicate issue because of how the consensus works underneath and the places that need to track such peers. When adding a peer the procedure is as follows: * Try to open a connection to the new peer and abort if not reachable * Broadcast a PeerManagerAddPeer operation which tells all cluster members to add the new Peer. The Raft leader will add it to Raft's peerset and the multiaddress will be saved in the ClusterPeers configuration key. * If the above fails because some cluster node is not responding, broadcast a PeerRemove() and try to undo any damage. * If the broadcast succeeds, send our ClusterPeers to the new Peer along with the local multiaddress we are using in the connection opened in the first step (that is the multiaddress through which the other peer can reach us) * The new peer updates its configuration with the new list and joins the consensus License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-02-02 13:51:49 +01:00

17 Commits