ipfs-cluster

Author	SHA1	Message	Date
Hector Sanjuan	755cebbe0d	Enable spell checking and fix spelling errors (using US locale)	2022-06-16 17:43:30 +02:00
Hector Sanjuan	508791b547	Migrate from ipfs/ipfs-cluster to ipfs-cluster/ipfs-cluster This performs the necessary renamings.	2022-06-16 17:43:30 +02:00
Hector Sanjuan	4daece2b98	Feat: add a new "pinqueue" informer component This new component broadcasts metrics about the current size of the pinqueue, which can in turn be used to inform allocations. It has a weight_bucket_size option that serves to divide the actual size by a given factor. This allows considering peers with similar queue sizes to have the same weight. Additionally, some changes have been made to the balanced allocator so that a combination of tags, pinqueue sizes and free-spaces can be used. When allocating by [<tag>, pinqueue, freespace], the allocator will prioritize choosing peers with the smallest pin queue weight first, and of those with the same weight, it will allocate based on freespace.	2022-06-16 17:43:29 +02:00
Hector Sanjuan	a97ed10d0b	Adopt api.Cid type - replaces cid.Cid everwhere. This commit introduces an api.Cid type and replaces the usage of cid.Cid everywhere. The main motivation here is to override MarshalJSON so that Cids are JSON-ified as '"Qm...."' instead of '{ "/": "Qm....." }', as this "ipld" representation of IDs is horrible to work with, and our APIs are not issuing IPLD objects to start with. Unfortunately, there is no way to do this cleanly, and the best way is to just switch everything to our own type.	2022-04-07 14:27:39 +02:00
Hector Sanjuan	1d98538411	Adders: stream blocks to destinations This commit fixes #810 and adds block streaming to the final destinations when adding. This should add major performance gains when adding data to clusters. Before, everytime cluster issued a block, it was broadcasted individually to all destinations (new libp2p stream), where it was block/put to IPFS (a single block/put http roundtrip per block). Now, blocks are streamed all the way from the adder module to the ipfs daemon, by making every block as it arrives a single part in a multipart block/put request. Before, block-broadcast needed to wait for all destinations to finish in order to process the next block. Now, buffers allow some destinations to be faster than others while sending and receiving blocks. Before, if a block put request failed to be broadcasted everywhere, an error would happen at that moment. Now, we keep streaming until the end and only then report any errors. The operation succeeds as long as at least one stream finished successfully. Errors block/putting to IPFS will not abort streams. Instead, subsequent blocks are retried with a new request, although the method will return an error when the stream finishes if there were errors at any point.	2022-03-24 17:24:58 +01:00
Hector Sanjuan	eee53bfa4f	Streaming Peers(): make Peers() a streaming call This commit makes all the changes to make Peers() a streaming call. While Peers is usually a non problematic call, for consistency, all calls returning collections assembled through broadcast to cluster peers are now streaming calls.	2022-03-23 01:27:57 +01:00
Hector Sanjuan	0d73d33ef5	Pintracker: streaming methods This commit continues the work of taking advantage of the streaming capabilities in go-libp2p-gorpc by improving the ipfsconnector and pintracker components. StatusAll and RecoverAll methods are now streaming methods, with the REST API output changing accordingly to produce a stream of GlobalPinInfos rather than a json array. pin/ls request to the ipfs daemon now use ?stream=true and avoid having to load the full pinset map on memory. StatusAllLocal and RecoverAllLocal requests to the pin tracker stream all the way and no longer store the full pinset, and the full PinInfo status slice before sending it out. We have additionally switched to a pattern where streaming methods receive the channel as an argument, allowing the caller to decide on whether to launch a goroutine, do buffering etc.	2022-03-22 15:38:01 +01:00
Hector Sanjuan	9b9d76f92d	Pinset streaming and method type revamp This commit introduces the new go-libp2p-gorpc streaming capabilities for Cluster. The main aim is to work towards heavily reducing memory usage when working with very large pinsets. As a side-effect, it takes the chance to revampt all types for all public methods so that pointers to static what should be static objects are not used anymore. This should heavily reduce heap allocations and GC activity. The main change is that state.List now returns a channel from which to read the pins, rather than pins being all loaded into a huge slice. Things reading pins have been all updated to iterate on the channel rather than on the slice. The full pinset is no longer fully loaded onto memory for things that run regularly like StateSync(). Additionally, the /allocations endpoint of the rest API no longer returns an array of pins, but rather streams json-encoded pin objects directly. This change has extended to the restapi client (which puts pins into a channel as they arrive) and to ipfs-cluster-ctl. There are still pending improvements like StatusAll() calls which should also stream responses, and specially BlockPut calls which should stream blocks directly into IPFS on a single call. These are coming up in future commits.	2022-03-19 03:02:55 +01:00
Hector Sanjuan	3029972750	pinsvcapi: test GetPin and DeletePin endpoints	2022-03-11 16:41:48 +01:00
Hector Sanjuan	fbc69ee3c6	pinsvcapi: fix several API test failures	2022-03-11 16:18:08 +01:00
Hector Sanjuan	cd5f9c869d	pinsvcapi: Test and fix bugs in Add Pin. Improve performance.	2022-03-11 14:01:15 +01:00
Hector Sanjuan	f8812b3af7	pinsvc: fixes and testing for List endpoint	2022-03-11 12:58:12 +01:00
Hector Sanjuan	5fed4a2c5e	types: include IPFSAddresses in pinInfo objects. pinsvcapi: do not cache peer information here as all the needed information is in the status objects. This adds ipfs_addresses as a field broadcasted with the ping metrics.	2022-03-10 23:49:01 +01:00
Hector Sanjuan	5b0d9d68e3	Merge branch 'master' into feat/pinning-api	2022-03-10 13:41:54 +01:00
Hector Sanjuan	d4073f9cfa	cluster: add PeersWithFilter option that only requests info for certain peer IDs (currently will be unused)	2022-02-02 00:43:00 +01:00
Hector Sanjuan	223b54cab6	Restapi: add "cids" query param to /pins This allows to specifically request status for several CIDs as provided in the "cids" query parameter, instead of request status for all CIDs. In this case, the filter is ignored.	2022-02-02 00:39:09 +01:00
Hector Sanjuan	5e89c0ba41	Pintracker: set Name in operation tracker. Fixes #1212 .	2022-01-31 21:04:11 +01:00
Hector Sanjuan	809b7fbda5	Pintracker: add IPFS ID to Pin Information Fixes #1554 Fixes: peer names unset for remote peers This adds an IPFS field to pin status information (PinInfoShort). It has not been easy to add this, given that the IPFS ID is something that comes from outside of cluster (unlike the peer name). After several tries I have settled in the following things: - Use the ping metric to send out peer names and IPFS IDs to the peers in the cluster. - Cache the latest known IPFS ID (if IPFS dies we should still be setting the ID). - Provide an RPC method for the Pintracker to obtain IPFS ID from the cache. - Given we now know information for peernames and IPFS IDs from other peers, we can use that information even if the requests to them error or we are not contacting (i.e. peers allocated as remote are not queried for status). We can use the information from the last received ping metric. - This means we should keep metrics around even if peers go away, at least for a while rather than deleting them as soon as we detect that they have expired. Puting it all together we now have a system to gossip peer information around on top of the ping metrics.	2022-01-31 17:53:09 +01:00
Hector Sanjuan	ea5e18078c	Informers: GetMetric() -> GetMetrics() Support returning multiple metrics per informer.	2021-09-15 20:07:37 +02:00
Hector Sanjuan	edfcfa3fb0	Fix #1360 : Efficient pinset status with filters This commit modifies the pintracker StatusAll call to take a status filter. This allows to skip a PinLs call to ipfs when checking status for items that are queued, pinning, unpinning or in error. Those status come directly from the operation tracker. This should result in a significant performance increase for those calls, particularly in nodes with several hundred thousand pins and more, where the call to IPFS is very expensive. A new TrackerStatusUnexpectedlyUnpinned status has been introduce to differentiate between pin errors (tracked by the operation tracker) and "lost" items (which before were pin errors too). This new status is handled by the Recover() operation as before.	2021-07-06 11:34:19 +02:00
Hector Sanjuan	90208b45f9	health/alerts endpoint: brush up old PR	2021-01-13 22:09:21 +01:00
Hector Sanjuan	4bcb91ee2b	Merge branch 'master' into feat/alerts	2021-01-13 21:08:49 +01:00
Hector Sanjuan	c026299b95	Include Name as GlobalPinInfo key and consolidate redundant keys GlobalPinInfo objects carried redundant information (Cid, Peer) that takes space and time to serialize. This has been addressed by having GlobalPinInfo embed PinInfoShort rather than PinInfo. This new types ommits redundant fields.	2020-05-16 02:27:24 +02:00
Hector Sanjuan	fa762d78cf	Improve PinLsCid to check for pinned items Receive the full pin object so that it can decide whether to check for recursive or direct pins directly. Additionally, unpin will not check for the pin presence anymore and simply trigger unpins (ignoring errors)	2020-04-21 17:23:55 +02:00
Hector Sanjuan	717ed85823	gofmt -s fixes	2020-04-14 23:44:18 +02:00
Hector Sanjuan	f83ff9b655	staticcheck: fix all staticcheck warnings in the project	2020-04-14 20:16:10 +02:00
Kishan Mohanbhai Sagathiya	68abae9287	Merge branch 'master' into feat/alerts	2019-12-23 12:45:22 +05:30
Kishan Mohanbhai Sagathiya	a3b8767e87	Added tests for Alerts - tests for related cluster method, rest api, client method etc - clean expired alerts everytime a new alerts come in	2019-12-23 12:42:38 +05:30
Hector Sanjuan	09d933fde1	pintracker: take care of tests Simplify the tests, remove things that are not used at all, align the behaviour of the mocks, add methods to test the correct behaviour of Status etc.	2019-12-13 12:03:01 +01:00
Kishan Sagathiya	5258a4d428	Remove map pintracker (#944 ) This removes mappintracker and sets stateless tracker as the default (and only) pintracker component. Because the stateless tracker matches the cluster state with only ongoing operations being kept on memory, and additional information provided by ipfs-pin-ls, syncing operations are not necessary. Therefore the Sync/SyncAll operations are removed cluster-wide.	2019-12-12 21:22:54 +01:00
Kishan Sagathiya	e1faf12bae	ipfsproxy: hijack repo/gc and trigger cluster-wide GC This adds hijacking of the repo/gc endpoint to the proxy to do cluster-wide gc.	2019-12-06 13:08:57 +01:00
Hector Sanjuan	249d9007d2	Merge branch 'master' into feat/cluster-gc	2019-11-07 18:35:42 +01:00
Kishan Sagathiya	31534a429b	Fix #374 : `health metrics` improvements - Human-sizes for freespace metrics. Display whether if metric is expires in something like "expires in 3m". - When not passing metric name `ipfs-cluster-ctl health metrics` hits the the metrics endpoint which returns a list of available metrics and displays to user - Humanize metrics output - Sort metrics output	2019-10-24 16:37:26 +02:00
Kishan Mohanbhai Sagathiya	492b5612e7	Add ability to run Garbage Collector on all peers - cluster method, ipfs connector method, rpc and rest apis, command, etc for repo gc - Remove extra space from policy generator - Added special timeout for `/repo/gc` call to IPFS - Added `RepoGCLocal` cluster rpc method, which will be used to run gc on local IPFS daemon - Added peer name to the repo gc struct - Sorted with peer ids, while formatting(only affects cli results) - Special timeout setting where timeout gets checked from last update - Added `local` argument, which would run gc only on contacted peer	2019-10-22 11:13:19 +05:30
Kishan Mohanbhai Sagathiya	9cb1cdeaff	Merge branch 'master' into feat/recover-all	2019-09-08 17:06:43 +07:00
Hector Sanjuan	d63a7fd641	Merge pull request #877 from ipfs/fix/ipfs-to-p2p Use `p2p` protocol name over `ipfs` for multiaddr	2019-09-06 15:00:36 +02:00
Kishan Mohanbhai Sagathiya	46d77d9d50	Fixed tests, RecoverLocal and RecoverAllLocal rpc api not closed anymore	2019-09-05 18:29:34 +07:00
Kishan Mohanbhai Sagathiya	512bf6a13b	Pin recover on all peers - recover works without `--local` flag as well (recovers all pins on all peers) - remove extra space from rpc policy Fixes #763	2019-08-21 11:19:07 +05:30
Kishan Mohanbhai Sagathiya	6656b80a00	Some more occurences of /ipfs and use SwapToP2pMultiaddrs (very helpful since ipfs still send addresses with `/ipfs` tag)	2019-08-16 11:56:09 +05:30
Hector Sanjuan	b6b44f65f7	Adder: fix tests rpc mock returned 0 allocations and things started failing.	2019-08-13 17:26:02 +02:00
Kishan Sagathiya	c0b8301525	Fix #854 : 404 on deleting a pin that isn't part of pinset (#854 ) With this commit - If cid in `DELETE /pins/{cid}` isn't part of the pinset, it would return 404 - If path in `DELETE /pins/{keyType}/{path}` resolves to a cid that isn't part of the pinset, it would return 404	2019-07-29 13:26:53 +02:00
Hector Sanjuan	7c636061bd	Improve pin/unpin method signatures (#843 ) * Improve pin/unpin method signatures: These changes the following Cluster Go API methods: * -> Cluster.Pin(ctx, cid, options) (pin, error) * -> Cluster.Unpin(ctx, cid) (pin, error) * -> Cluster.PinPath(ctx, path, opts) (pin,error) Pin and Unpin now return the pinned object. The signature of the methods now matches that of the API Client, is clearer as to what options the user can set and is aligned with PinPath, UnpinPath, which returned pin methods. The REST API now returns the Pinned/Unpinned object rather than 204-Accepted. This was necessary for a cleaner pin/update approach, which I'm working on in another branch. Most of the changes here are updating tests to the new signatures * Adapt load-balancing client to new Pin/Unpin signatures * cluster.go: Fix typo Co-Authored-By: Kishan Sagathiya <kishansagathiya@gmail.com> * cluster.go: Fix typo Co-Authored-By: Kishan Sagathiya <kishansagathiya@gmail.com>	2019-07-22 15:39:11 +02:00
Hector Sanjuan	b804e61ef0	Update deps along with go-libp2p-core refactor Lots of rewrites in imports...	2019-06-14 13:10:45 +02:00
Hector Sanjuan	a0eeddfae7	Test: remove removed endpoints from mock RPC	2019-05-09 22:52:02 +02:00
Hector Sanjuan	3d49ac26a5	Feat: Split components into RPC Services I had thought of this for a very long time but there were no compelling reasons to do it. Specifying RPC endpoint permissions becomes however significantly nicer if each Component is a different RPC Service. This also fixes some naming issues like having to prefix methods with the component name to separate them from methods named in the same way in some other component (Pin and IPFSPin).	2019-05-04 21:36:10 +01:00
Hector Sanjuan	da24114ae0	Proxy: hijack pin/update The IPFS pin/update endpoint takes two arguments and usually unpins the first and pins the second. It is a bit more efficient to do it in a single operation than two separate ones. This will make the proxy endpoint hijack pin/update requests. First, the FROM pin is fetched from the state. If present, we set the options (replication factors, actual allocations) from that pin to the new one. Then we pin the TO item and proceed to unpin the FROM item when `unpin` is not false. We need to support path resolving, just like IPFS, therefore it was necessary to expose IPFSResolve() via RPC.	2019-04-29 16:36:40 +02:00
Kishan Mohanbhai Sagathiya	226953dd26	Make IPFSID pointer Make IPFSID pointer so that it can be made nil when empty	2019-03-18 18:24:56 +05:30
Hector Sanjuan	23db807b87	ipfsproxy: use PinPath to match IPFS behaviour License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2019-03-04 15:54:34 +00:00
Hector Sanjuan	ea85cf7805	Rename "test.Test" to "test." (test.TestCid1 -> test.Cid1) License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2019-02-27 20:19:10 +00:00
Hector Sanjuan	9df6344a07	Avoid using string testing CIDs and use cid.Cids directly License: MIT Signed-off-by: Hector Sanjuan <hector@protocol.ai>	2019-02-27 20:09:31 +00:00

1 2

89 Commits