ipfs-cluster

Author	SHA1	Message	Date
Hector Sanjuan	de40b2cd23	pintracker: metrics: convert pinning/queued/error metrics to gauges We were currently tracking this metrics as a counter (SUM). The number is good, but conceptually this is more a gauge (LastValue), given it can go down. Thus we switch it by tracking the aggregation numbers directy in the operation tracker.	2022-04-26 15:13:35 +02:00
Hector Sanjuan	3169fba9d1	metrics: track total pins, queued, pinning, pin error. This fixes #1470 and #1187.	2022-04-22 15:57:48 +02:00
Hector Sanjuan	a97ed10d0b	Adopt api.Cid type - replaces cid.Cid everwhere. This commit introduces an api.Cid type and replaces the usage of cid.Cid everywhere. The main motivation here is to override MarshalJSON so that Cids are JSON-ified as '"Qm...."' instead of '{ "/": "Qm....." }', as this "ipld" representation of IDs is horrible to work with, and our APIs are not issuing IPLD objects to start with. Unfortunately, there is no way to do this cleanly, and the best way is to just switch everything to our own type.	2022-04-07 14:27:39 +02:00
Hector Sanjuan	0d73d33ef5	Pintracker: streaming methods This commit continues the work of taking advantage of the streaming capabilities in go-libp2p-gorpc by improving the ipfsconnector and pintracker components. StatusAll and RecoverAll methods are now streaming methods, with the REST API output changing accordingly to produce a stream of GlobalPinInfos rather than a json array. pin/ls request to the ipfs daemon now use ?stream=true and avoid having to load the full pinset map on memory. StatusAllLocal and RecoverAllLocal requests to the pin tracker stream all the way and no longer store the full pinset, and the full PinInfo status slice before sending it out. We have additionally switched to a pattern where streaming methods receive the channel as an argument, allowing the caller to decide on whether to launch a goroutine, do buffering etc.	2022-03-22 15:38:01 +01:00
Hector Sanjuan	9b9d76f92d	Pinset streaming and method type revamp This commit introduces the new go-libp2p-gorpc streaming capabilities for Cluster. The main aim is to work towards heavily reducing memory usage when working with very large pinsets. As a side-effect, it takes the chance to revampt all types for all public methods so that pointers to static what should be static objects are not used anymore. This should heavily reduce heap allocations and GC activity. The main change is that state.List now returns a channel from which to read the pins, rather than pins being all loaded into a huge slice. Things reading pins have been all updated to iterate on the channel rather than on the slice. The full pinset is no longer fully loaded onto memory for things that run regularly like StateSync(). Additionally, the /allocations endpoint of the rest API no longer returns an array of pins, but rather streams json-encoded pin objects directly. This change has extended to the restapi client (which puts pins into a channel as they arrive) and to ipfs-cluster-ctl. There are still pending improvements like StatusAll() calls which should also stream responses, and specially BlockPut calls which should stream blocks directly into IPFS on a single call. These are coming up in future commits.	2022-03-19 03:02:55 +01:00
Hector Sanjuan	5e89c0ba41	Pintracker: set Name in operation tracker. Fixes #1212 .	2022-01-31 21:04:11 +01:00
Hector Sanjuan	809b7fbda5	Pintracker: add IPFS ID to Pin Information Fixes #1554 Fixes: peer names unset for remote peers This adds an IPFS field to pin status information (PinInfoShort). It has not been easy to add this, given that the IPFS ID is something that comes from outside of cluster (unlike the peer name). After several tries I have settled in the following things: - Use the ping metric to send out peer names and IPFS IDs to the peers in the cluster. - Cache the latest known IPFS ID (if IPFS dies we should still be setting the ID). - Provide an RPC method for the Pintracker to obtain IPFS ID from the cache. - Given we now know information for peernames and IPFS IDs from other peers, we can use that information even if the requests to them error or we are not contacting (i.e. peers allocated as remote are not queried for status). We can use the information from the last received ping metric. - This means we should keep metrics around even if peers go away, at least for a while rather than deleting them as soon as we detect that they have expired. Puting it all together we now have a system to gossip peer information around on top of the ping metrics.	2022-01-31 17:53:09 +01:00
Hector Sanjuan	c4ca5b7abe	pintracker: carry over attempt account only for operations of the same type	2021-11-30 04:20:35 +01:00
Hector Sanjuan	7cf40de354	pintracker: Fix attempt count not increasing. Expose priority queueing. "RetryCount" has been renamed to "AttemptCount", because it counts attempts.	2021-11-30 04:20:35 +01:00
Hector Sanjuan	29c277b67f	Pintracker: add and track retry counts in the operation manager. Report retry count in the PinStatus	2021-11-30 04:20:35 +01:00
Hector Sanjuan	88cfcf62fc	Pintracker: use cid.Cid as map keys. (#1322 ) * No reason not to. * Should save a non trivial amount of time when doing "Status" operation by not having to stringify Cids.	2021-03-16 15:47:44 +01:00
Hector Sanjuan	c026299b95	Include Name as GlobalPinInfo key and consolidate redundant keys GlobalPinInfo objects carried redundant information (Cid, Peer) that takes space and time to serialize. This has been addressed by having GlobalPinInfo embed PinInfoShort rather than PinInfo. This new types ommits redundant fields.	2020-05-16 02:27:24 +02:00
Hector Sanjuan	b3853caf36	Dependency ugprade: changes needed * Libp2p protectors no longer needed, use PSK directly * Generate cluster 32-byte secret here (helper gone from pnet) * Switch to go-log/v2 in all places * DHT bootstrapping not needed. Adjust DHT options for tests. * Do not rely on dissappeared CidToDsKey and DsKeyToCid functions fro dshelp. * Disable QUIC (does not support private networks) * Fix tests: autodiscovery started working properly	2020-03-22 14:50:25 +01:00
Kishan Sagathiya	5258a4d428	Remove map pintracker (#944 ) This removes mappintracker and sets stateless tracker as the default (and only) pintracker component. Because the stateless tracker matches the cluster state with only ongoing operations being kept on memory, and additional information provided by ipfs-pin-ls, syncing operations are not necessary. Therefore the Sync/SyncAll operations are removed cluster-wide.	2019-12-12 21:22:54 +01:00
Hector Sanjuan	b804e61ef0	Update deps along with go-libp2p-core refactor Lots of rewrites in imports...	2019-06-14 13:10:45 +02:00
Adrian Lanzafame	f1afce7644	add String method for Operation and OperationTracker types License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2019-05-07 19:09:05 +10:00
Adrian Lanzafame	43fb2cf857	fix typo in comment License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2019-05-07 12:01:20 +10:00
Hector Sanjuan	6447ea51d2	Remove *Serial types. Use pointers for all types. This takes advantange of the latest features in go-cid, peer.ID and go-multiaddr and makes the Go types serializable by default. This means we no longer need to copy between Pin <-> PinSerial, or ID <-> IDSerial etc. We can now efficiently binary-encode these types using short field keys and without parsing/stringifying (in many cases it just a cast). We still get the same json output as before (with minor modifications for Cids). This should greatly improve Cluster performance and memory usage when dealing with large collections of items. License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2019-02-27 17:04:35 +00:00
Adrian Lanzafame	3b3f786d68	add opencensus tracing and metrics This commit adds support for OpenCensus tracing and metrics collection. This required support for context.Context propogation throughout the cluster codebase, and in particular, the ipfscluster component interfaces. The tracing propogates across RPC and HTTP boundaries. The current default tracing backend is Jaeger. The metrics currently exports the metrics exposed by the opencensus http plugin as well as the pprof metrics to a prometheus endpoint for scraping. The current default metrics backend is Prometheus. Metrics are currently exposed by default due to low overhead, can be turned off if desired, whereas tracing is off by default as it has a much higher performance overhead, though the extent of the performance hit can be adjusted with smaller sampling rates. License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2019-02-04 18:53:21 +10:00
Hector Sanjuan	74311c5969	Address review comments Remove unused code from tests License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-11-01 11:12:38 +01:00
Hector Sanjuan	a7787029cb	Fix #600 : Remote pins should not error License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-10-30 16:33:05 +01:00
Kishan Sagathiya	2cd4420ee8	Issue #446 Adding peername to PinInfo Fixing errors and better code placement License: MIT Signed-off-by: Kishan Mohanbhai Sagathiya <kishansagathiya@gmail.com>	2018-09-24 23:20:37 +05:30
Kishan Sagathiya	773b4de1f0	Issue #446 Adding peername to PinInfo Removed comments and code used for debugging License: MIT Signed-off-by: Kishan Mohanbhai Sagathiya <kishansagathiya@gmail.com>	2018-09-24 23:04:16 +05:30
Kishan Sagathiya	ef85ba8780	Issue #446 Adding peername to PinInfo This commit adds peername to PinInfo and GlobalPinInfo so that we have a nicer and more meaningfull output for `ipfs-cluster-ctl` queries like `status`, `sync` and `recover` License: MIT Signed-off-by: Kishan Mohanbhai Sagathiya <kishansagathiya@gmail.com>	2018-09-24 23:04:16 +05:30
Kishan Sagathiya	ff48fb319a	Issue #446 Adding peername to PinInfo This commit adds peername to PinInfo and GlobalPinInfo so that we have a nicer and more meaningfull output for queries like `ipfs-cluster-ctl status` License: MIT Signed-off-by: Kishan Mohanbhai Sagathiya <kishansagathiya@gmail.com>	2018-09-24 23:04:16 +05:30
Adrian Lanzafame	31474f6490	update go-cid and go-libp2p License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-09-24 11:35:38 +10:00
Adrian Lanzafame	c6eada9db5	uses new gorpc method to distinguish err type License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-08-14 13:34:02 +02:00
Adrian Lanzafame	33f56e8867	stateless: reduce num of methods License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-08-14 13:33:39 +02:00
Adrian Lanzafame	bec77546ce	optracker: fix complexity of filter fn License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-08-14 13:33:39 +02:00
Adrian Lanzafame	df2753dfc6	implements a stateless pintracker Also updates to the optracker to make retrieving information easier. License: MIT Signed-off-by: Adrian Lanzafame <adrianlanzafame92@gmail.com>	2018-08-14 13:33:36 +02:00
Hector Sanjuan	de56cf166e	Address comments License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-28 11:59:26 +02:00
Hector Sanjuan	01f7a9e4e8	Fix: maptracker race issues This commit attempts to fix race issues in the maptracker since the introduction of the OperationTracker. There were two main problems: * Duplicity tracking the state both in the state map and the opTracker * Non atomiciy of operations with different threads being able to affect other threads operations. A test performing random Track/Untracks on the same Cid quickly showed that items would sometimes stay as pin_queued or pin_unqueued. That happened because operations could be cancelled under the hood by a different request, while leaving the map status untouched. It was not simply to deal with this issues without a refactoring. First, the state map has been removed, and the operation tracker now provides status information for any Cid. This implies that the tracker keeps all operations and operations have a `PhaseDone`. There's also a new `OperationRemote` type. Secondly, operations are only created in the tracker and can only be removed by their creators (they can be overwritten by other operations though). Operations cannot be accessed directly and modifications are limited to setting Error for PhaseDone operations. After created, *Operations are queued in the pinWorker queues which handle any status updates. This means, that, even when an operation has been removed from the tracker, status updates will not interfere with any other newer operations. In the maptracker, only the Unpin worker Cleans operations once processed. A sucessful unpin is the only way that a delete() happens in the tracker map. Otherwise, operations stay there until a newer operation for the Cid arrives and 1) cancels the existing one 2) takes its place. The tracker refuses to create a new operation if a similar "ongoing" operation of the same type exists. The final change is that Recover and RecoverAll() are not async and play by the same rules as Track() and Untrack(), queueing the items to be recovered. Note: for stateless pintracker, the tracker will need to Clean() operation of type OperationPin as well, and complement the Status reported by the tracker with those coming from IPFS. License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-28 11:59:26 +02:00
Hector Sanjuan	acc8366f58	Rename optracker to pintracker/optracker License: MIT Signed-off-by: Hector Sanjuan <code@hector.link>	2018-05-28 11:59:26 +02:00

33 Commits