-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SPARK CID sampling alpha #43
Comments
Related work we may leverage later: data-preservation-programs/spade#6 |
How to find the list of Client IDs that are participating in FIL+ LDN program for data that should be publicly retrievable:
Presumably, this list can be obtained by inspecting on-chain data, we don't necessarily have to use the When inspecting StorageMarketActor state for the list of deals, we can sample only deals made by LDN clients. |
We can do this programmatically, too:
API docs: https://api.datacapstats.io/docs |
Until we have IPNI endpoint for sampling Payload CIDs, we may want to lean into the approach based on analysing Piece data as explored by RetrievalBot: data-preservation-programs/RetrievalBot#36 |
How to get an API key:
|
Next steps:
|
What's remaining:
We already have that data in InfluxDB as of filecoin-station/spark-evaluate#61, but I am reworking that part in filecoin-station/spark-evaluate#67, so I am waiting with dashboards until the second PR is landed. |
Visualisation in SPARK dashboard |
eta: 2023-10-31
description: Remove the static list of job templates and replace it with dynamic (CID, SP) selection sampling data stored in FIL+ deals. Depending on the complexity of the “proper” CID sampling we envision, this milestone can implement a simplified version or a part of the grand solution.
Current idea:
See also:
Dependencies:
The text was updated successfully, but these errors were encountered: