ipfs-cluster

Author	SHA1	Message	Date
Hector Sanjuan	9330ac82e2	Fix tests with latest libp2p License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-10-25 14:55:01 +02:00
Hector Sanjuan	cbdd075bb2	Merge pull request #551 from ipfs/fix/minor-version-compat Allow running peers with different cluster versions in the same cluster	2018-09-27 08:13:53 +02:00
Hector Sanjuan	59714f69d4	Allow running peers with different cluster versions in the same cluster This patch modifies the RPC protocol tag to use Major and Minor parts of the version and not all of it. This means all peers on the 0.5.x can run in the same cluster. As cluster has become more mature and I see less risks in letting peers from similar versions run together. This is useful when upgrading too. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-09-27 07:35:57 +02:00
Kishan Sagathiya	b76b65e446	Issue #446 Adding peername to PinInfo Removed unnecessary peername assignment Modified tests according to the changes made to add peername to PinInfo License: MIT Signed-off-by: Kishan Mohanbhai Sagathiya <kishansagathiya@gmail.com>	2018-09-24 23:04:16 +05:30
Hector Sanjuan	5bbc699bb4	Issue #340 : Fix some data races Unfortunately, there are still some data races in yamux https://github.com/libp2p/go-libp2p/issues/396 so we can't enable this by default. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-15 12:27:01 +02:00
Hector Sanjuan	b8194143c2	Test with stateless tracker. Make map tracker default for tests. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-14 19:18:26 +02:00
Adrian Lanzafame	33f56e8867	stateless: reduce num of methods License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-08-14 13:33:39 +02:00
Adrian Lanzafame	df2753dfc6	implements a stateless pintracker Also updates to the optracker to make retrieving information easier. License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-08-14 13:33:36 +02:00
Hector Sanjuan	e2e84dfad1	Remove some commented code. Addressing a couple of comments. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Hector Sanjuan	6b9000fe7b	Fix broken test License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Hector Sanjuan	623120fd50	Start cluster tests License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Hector Sanjuan	65dc17a78b	testfixing License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-08-07 20:12:05 +02:00
Wyatt Daviau	2bc295ca9f	address comments: rebase, cleanup and separate tests, AddParams type replaces map of strings License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:24 +02:00
Wyatt Daviau	fabf191883	AddFile tests tests cover local and sharded adds of files ipfs mock and ipfs block put/get calls cleaned up License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:24 +02:00
Wyatt Daviau	238f3726f3	Pin datastructure updated to support sharding 4 PinTypes specify how CID is pinned Changes to Pin and Unpin to handle different PinTypes Tests for different PinTypes Migration for new state format using new Pin datastructures Visibility of the PinTypes used internally limited by default License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:23 +02:00
Wyatt Daviau	9f74f6f47d	Addressing third round of comments License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:23 +02:00
Wyatt Daviau	edb38b2830	fixing make check errors License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-08-07 20:11:23 +02:00
Hector Sanjuan	01f7a9e4e8	Fix: maptracker race issues This commit attempts to fix race issues in the maptracker since the introduction of the OperationTracker. There were two main problems: * Duplicity tracking the state both in the state map and the opTracker * Non atomiciy of operations with different threads being able to affect other threads operations. A test performing random Track/Untracks on the same Cid quickly showed that items would sometimes stay as pin_queued or pin_unqueued. That happened because operations could be cancelled under the hood by a different request, while leaving the map status untouched. It was not simply to deal with this issues without a refactoring. First, the state map has been removed, and the operation tracker now provides status information for any Cid. This implies that the tracker keeps all operations and operations have a `PhaseDone`. There's also a new `OperationRemote` type. Secondly, operations are only created in the tracker and can only be removed by their creators (they can be overwritten by other operations though). Operations cannot be accessed directly and modifications are limited to setting Error for PhaseDone operations. After created, *Operations are queued in the pinWorker queues which handle any status updates. This means, that, even when an operation has been removed from the tracker, status updates will not interfere with any other newer operations. In the maptracker, only the Unpin worker Cleans operations once processed. A sucessful unpin is the only way that a delete() happens in the tracker map. Otherwise, operations stay there until a newer operation for the Cid arrives and 1) cancels the existing one 2) takes its place. The tracker refuses to create a new operation if a similar "ongoing" operation of the same type exists. The final change is that Recover and RecoverAll() are not async and play by the same rules as Track() and Untrack(), queueing the items to be recovered. Note: for stateless pintracker, the tracker will need to Clean() operation of type OperationPin as well, and complement the Status reported by the tracker with those coming from IPFS. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-28 11:59:26 +02:00
Hector Sanjuan	73aabfa8ec	Fix #408 : Race condition when StateSync runs When StateSync() runs and triggers Untrack() on items that have just been removed from the state but on which Untrack() is underway, the operation tracker would be reset to phase queued and in some cases stay so. Also happened for Track() This caused failures of TestClustersPin as SyncStatus() is triggered regularly while Tracks() and Untracks() happen. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-12 10:49:45 +02:00
Hector Sanjuan	5ca8ca39eb	Monitor/tests: Allow to run tests using the basic monitor. Do it in additional stage in Travis. Also, test fixes. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-09 11:39:21 +02:00
Hector Sanjuan	bb8c20b2fb	Enable pubsubmon in cluster e2e tests License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-07 18:47:05 +02:00
Hector Sanjuan	029cd77c27	Merge pull request #398 from ipfs/feat/promote-consensus Emancipate the consensus component	2018-05-07 08:29:14 +02:00
Hector Sanjuan	33d9cdd3c4	Feat: emancipate Consensus from the Cluster component This commit promotes the Consensus component (and Raft) to become a fully independent thing like other components, passed to NewCluster during initialization. Cluster (main component) no longer creates the consensus layer internally. This has triggered a number of breaking changes that I will explain below. Motivation: Future work will require the possibility of running Cluster with a consensus layer that is not Raft. The "consensus" layer is in charge of maintaining two things: * The current cluster peerset, as required by the implementation * The current cluster pinset (shared state) While the pinset maintenance has always been in the consensus layer, the peerset maintenance was handled by the main component (starting by the "peers" key in the configuration) AND the Raft component (internally) and this generated lots of confusion: if the user edited the peers in the configuration they would be greeted with an error. The bootstrap process (adding a peer to an existing cluster) and configuration key also complicated many things, since the main component did it, but only when the consensus was initialized and in single peer mode. In all this we also mixed the peerstore (list of peer addresses in the libp2p host) with the peerset, when they need not to be linked. By initializing the consensus layer before calling NewCluster, all the difficulties in maintaining the current implementation in the same way have come to light. Thus, the following changes have been introduced: * Remove "peers" and "bootstrap" keys from the configuration: we no longer edit or save the configuration files. This was a very bad practice, requiring write permissions by the process to the file containing the private key and additionally made things like Puppet deployments of cluster difficult as configuration would mutate from its initial version. Needless to say all the maintenance associated to making sure peers and bootstrap had correct values when peers are bootstrapped or removed. A loud and detailed error message has been added when staring cluster with an old config, along with instructions on how to move forward. * Introduce a PeerstoreFile ("peerstore") which stores peer addresses: in ipfs, the peerstore is not persisted because it can be re-built from the network bootstrappers and the DHT. Cluster should probably also allow discoverability of peers addresses (when not bootstrapping, as in that case we have it), but in the meantime, we will read and persist the peerstore addresses for cluster peers in this file, different from the configuration. Note that dns multiaddresses are now fully supported and no IPs are saved when we have DNS multiaddresses for a peer. * The former "peer_manager" code is now a pstoremgr module, providing utilities to parse, add, list and generally maintain the libp2p host peerstore, including operations on the PeerstoreFile. This "pstoremgr" can now also be extended to perform address autodiscovery and other things indepedently from Cluster. * Create and initialize Raft outside of the main Cluster component: since we can now launch Raft independently from Cluster, we have more degrees of freedom. A new "staging" option when creating the object allows a raft peer to be launched in Staging mode, waiting to be added to a running consensus, and thus, not electing itself as leader or doing anything like we were doing before. This additionally allows us to track when the peer has become a Voter, which only happens when it's caught up with the state, something that was wonky previously. * The raft configuration now includes an InitPeerset key, which allows to provide a peerset for new peers and which is ignored when staging==true. The whole Raft initialization code is way cleaner and stronger now. * Cluster peer bootsrapping is now an ipfs-cluster-service feature. The --bootstrap flag works as before (additionally allowing comma-separated-list of entries). What bootstrap does, is to initialize Raft with staging == true, and then call Join in the main cluster component. Only when the Raft peer transitions to Voter, consensus becomes ready, and cluster becomes Ready. This is cleaner, works better and is less complex than before (supporting both flags and config values). We also backup and clean the state whenever we are boostrapping, automatically * ipfs-cluster-service no longer runs the daemon. Starting cluster needs now "ipfs-cluster-service daemon". The daemon specific flags (bootstrap, alloc) are now flags for the daemon subcommand. Here we mimic ipfs ("ipfs" does not start the daemon but print help) and pave the path for merging both service and ctl in the future. While this brings some breaking changes, it significantly reduces the complexity of the configuration, the code and most importantly, the documentation. It should be easier now to explain the user what is the right way to launch a cluster peer, and more difficult to make mistakes. As a side effect, the PR also: * Fixes #381 - peers with dynamic addresses * Fixes #371 - peers should be Raft configuration option * Fixes #378 - waitForUpdates may return before state fully synced * Fixes #235 - config option shadowing (no cfg saves, no need to shadow) License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-07 07:39:41 +02:00
Hector Sanjuan	e186dbe2c2	Undo extra delays License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-02 15:24:26 +02:00
Adrian Lanzafame	f68c7f5354	ipfshttp: add pin/unpin specific timeouts and get the tests passing and add Pin/UnpinQueued tracker statuses back in. License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-05-02 15:24:26 +02:00
Sina Mahmoodi	03cc809708	config: Add log and testcase for disable_repinning * Test case creates a bunch of clusters, assigns a pin with replica factor of n-1 to them, and removes one of the peers randomly. It then tests to check that the number of clusters pinning the cid is n-2. * Add warn log to let user know that due to disable_repinning option, the cluster won't attempt to re-assign the pin. License: MIT Signed-off-by: Sina Mahmoodi <itz.s1na@gmail.com>	2018-04-23 22:01:52 +02:00
Hector Sanjuan	0069c0062f	Fix metric expire type. Do not discard metrics in Allocate(). License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-04-05 17:57:24 +02:00
Hector Sanjuan	f5f56f2d11	Add some clarifications about delays License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-04-05 16:49:26 +02:00
Hector Sanjuan	c73e540b7b	Pre-create and pre-connect hosts in tests License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-04-05 16:49:26 +02:00
Hector Sanjuan	dd4128affc	Fix #339 : Reduce Sleeps in tests License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-04-05 16:49:26 +02:00
Hector Sanjuan	fb4bda8880	restapi: Expose the http endpoints on libp2p This commits allows restapi to serve/tunnel http on a libp2p stream. NewWitHost(...) allows to provide a libp2p host during initialization which is then used to obtain a listener with go-libp2p-gostream. Alternatively, if the configuration provides an ID, PrivateKey and Libp2pListenAddr, a host is created directly by us and used to get the listener. The protocol tag used is provided by the p2phttp library which will be used by the client. All tests now run against the libp2p node too. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-03-15 17:14:22 +01:00
Hector Sanjuan	41b17bf477	Cluster: add libp2p host parameter to constructor. NewCluster() now takes an optional Host parameter. The rationale is to allow to re-use an existing libp2p Host when creating the cluster. The NewClusterHost method now allows to create a host with the options used by cluster. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-03-15 00:04:54 +01:00
Hector Sanjuan	a180f1a5c5	Merge pull request #291 from ipfs/feat/connectivity-graph Feat/connectivity graph	2018-01-26 12:42:18 +01:00
Hector Sanjuan	4d26550e5c	Address comments License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-25 23:34:06 +01:00
Hector Sanjuan	ddb5da18c9	Tests: Bind testing clusters on random port Jenkins likes this very much. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-24 20:16:55 +01:00
ZenGround0	4b26ccd144	Merge branch 'master' into feat/connectivity-graph	2018-01-23 08:34:43 -05:00
Wyatt Daviau	d2ef32f48f	Testing and polishing connection graph Added go tests Refactored cluster connect graph to new file Refactored dot file printing to new repo Fixed code climate issues Added sharness test License: MIT Signed-off-by: Wyatt Daviau <wdaviau@cs.stanford.edu>	2018-01-22 10:03:37 -05:00
Hector Sanjuan	dcfc962f24	Feat #277 : Address review comments License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-19 22:24:03 +01:00
Hector Sanjuan	b013850f94	Fear #277 : Add test about wanted < 0 License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-19 22:24:03 +01:00
Hector Sanjuan	3c4b2ae27e	Fix #277 : Add tests for replication_min/max License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-16 16:36:06 +01:00
Hector Sanjuan	4549282cba	Fix #277 : Introduce maximum and minimum replication factor This PR replaces ReplicationFactor with ReplicationFactorMax and ReplicationFactor min. This allows a CID to be pinned even though the desired replication factor (max) is not reached, and prevents triggering re-pinnings when the replication factor has not crossed the lower threshold (min). License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-16 16:36:06 +01:00
Hector Sanjuan	bfdd59735c	Fix #240 : Add test flags. Remove build tags for logging control. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-16 09:28:40 +01:00
Wyatt	fc237b21d4	Feat: Enable Jenkins builds This enables support for testing in jenkins. Several minor adjustments have been performed to improve the probability that the tests pass, but there are still some random problems appearing with libp2p conections not becoming available or stopping working (similar to travis, but perhaps more often). MacOS and Windows builds are broken in worse ways (those issues will need to be addressed in the future). Thanks to @zenground0 and @victorbjelkholm for support! License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-01-11 18:11:46 +01:00
Hector Sanjuan	1e87fccf0e	Merge pull request #257 from te0d/feat/add-peer-identifier Added Hostname Property to Configuration	2017-12-04 14:25:49 +01:00
Tom O'Donnell (te0d)	7d43be33a4	Added Peername Configuration Test and Renamed to Peername I've modified the peer identifier to be 'peername'. I've also modified the TestLoadJSON to check that it is correctly read from config and set to a default if empty. Also added 'peername' fields to configurations for various tests.	2017-12-01 13:50:13 -05:00
Hector Sanjuan	11a8926236	MapPinTracker: support configuration section This also generates a default configuration section when it doesn't exist, so it's backwards compatible. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2017-11-29 14:42:50 +01:00
Hector Sanjuan	7a5f8f184b	Issue #131 : Improvements adding and removing This works on remove+shutdown procedure and fixes a few small issues. License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-11-01 13:00:32 +01:00
Hector Sanjuan	8f06baa1bf	Issue #162 : Rework configuration format The following commit reimplements ipfs-cluster configuration under the following premises: * Each component is initialized with a configuration object defined by its module * Each component decides how the JSON representation of its configuration looks like * Each component parses and validates its own configuration * Each component exposes its own defaults * Component configurations are make the sections of a central JSON configuration file (which replaces the current JSON format) * Component configurations implement a common interface (config.ComponentConfig) with a set of common operations * The central configuration file is managed by a config.ConfigManager which: * Registers ComponentConfigs * Assigns the correspondent sections from the JSON file to each component and delegates the parsing * Delegates the JSON generation for each section * Can be notified when the configuration is updated and must be saved to disk The new service.json would then look as follows: ```json { "cluster": { "id": "QmTVW8NoRxC5wBhV7WtAYtRn7itipEESfozWN5KmXUQnk2", "private_key": "<...>", "secret": "00224102ae6aaf94f2606abf69a0e278251ecc1d64815b617ff19d6d2841f786", "peers": [], "bootstrap": [], "leave_on_shutdown": false, "listen_multiaddress": "/ip4/0.0.0.0/tcp/9096", "state_sync_interval": "1m0s", "ipfs_sync_interval": "2m10s", "replication_factor": -1, "monitor_ping_interval": "15s" }, "consensus": { "raft": { "heartbeat_timeout": "1s", "election_timeout": "1s", "commit_timeout": "50ms", "max_append_entries": 64, "trailing_logs": 10240, "snapshot_interval": "2m0s", "snapshot_threshold": 8192, "leader_lease_timeout": "500ms" } }, "api": { "restapi": { "listen_multiaddress": "/ip4/127.0.0.1/tcp/9094", "read_timeout": "30s", "read_header_timeout": "5s", "write_timeout": "1m0s", "idle_timeout": "2m0s" } }, "ipfs_connector": { "ipfshttp": { "proxy_listen_multiaddress": "/ip4/127.0.0.1/tcp/9095", "node_multiaddress": "/ip4/127.0.0.1/tcp/5001", "connect_swarms_delay": "7s", "proxy_read_timeout": "10m0s", "proxy_read_header_timeout": "5s", "proxy_write_timeout": "10m0s", "proxy_idle_timeout": "1m0s" } }, "monitor": { "monbasic": { "check_interval": "15s" } }, "informer": { "disk": { "metric_ttl": "30s", "metric_type": "freespace" }, "numpin": { "metric_ttl": "10s" } } } ``` This new format aims to be easily extensible per component. As such, it already surfaces quite a few new options which were hardcoded before. Additionally, since Go API have changed, some redundant methods have been removed and small refactoring has happened to take advantage of the new way. License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2017-10-18 00:00:12 +02:00
dgrisham	25a910faad	BasicAuth implementation -- CLI, server, and tests.	2017-10-14 15:55:21 -06:00
dgrisham	b1356cd33a	removing name arg from NewInformer	2017-09-01 15:02:15 -06:00

1 2

97 Commits