KV Limits #121

0x4007 · 2024-09-30T18:14:42Z

We hit 90% daily quota today. Given that we don't even have any partners using the system yet, this is looking grim.
I hope that we can find a way to optimize our KV usage (or we can figure some cheaper alternative, but I am skeptical.)

Projected Costs

The good news is that our limits go from 1000 -> 33,333, so we get essentially 33x capacity for $5/month.
I estimate that once we get most of the planned plugins up and running, we'll be closer to 5k a day.
6,666.6666666667 per $1 of cost
I think each large partner will cost us approximately $1 a month on KV
- Smaller ones will probably be closer to the 1k we are using now.

Next Steps

Let's discuss how we can optimize the KV usage of the kernel.

The text was updated successfully, but these errors were encountered:

0x4007 · 2024-09-30T18:15:37Z

@gentlementlegen @whilefoo rfc

whilefoo · 2024-09-30T18:29:08Z

First we need to figure out what's using the KV so much. Can you see in the dashboard if it's kernel or some other plugin?

0x4007 · 2024-09-30T22:26:57Z

Unfortunately, I can't find any useful information from the analytics. I included in the screenshots everything relevant I could find.

gentlementlegen · 2024-10-01T01:35:33Z

Lately we added https://github.com/ubiquity-os-marketplace/generate-vector-embeddings that reacts to 7 different events. Each run of a plugin equals one KV put. The more plugins and events we listen to, the more the usage will increase.

gentlementlegen · 2024-10-01T01:47:31Z

I think the base of the problem is that we need to persist data between runs since the worker gets destroyed after its job is done, which is why we used KV in the first place. So the alternatives to store the data we could maybe consider:

supabase
redis

0x4007 · 2024-10-01T04:12:50Z

I got the 90% warning again today and today just started. Yes I have a feeling it must be the vector embeddings plugin. Perhaps we need to optimize it.

@sshivaditya2019 as a heads up, let us know if you have ideas for optimizing your plugin. Cloudflare KV is used to manage state across "plugin chain" runs. Basically in our config when we define multiple plugins to be invoked by a specific webhook (such as issues_comment.created) the kernel keeps track of what comes next per what job using KV. It seems that this plugin is using more than all of our plugins combined, and by 2-3x.

Is it realistic to check if it's the last event in the plugin chain and not keep track of it anymore? That way we can put the heavy ones at the end, like vector embeddings?

For example, issue comment created and we have three plugins. Kernel executes first and second normally, but it knows the last one is next so it executes and does not read/write KV. If it executes it I don't see why it needs to keep track anymore.

sshivaditya2019 · 2024-10-01T05:52:37Z

I got the 90% warning again today and today just started. Yes I have a feeling it must be the vector embeddings plugin. Perhaps we need to optimize it.

@sshivaditya2019 as a heads up, let us know if you have ideas for optimizing your plugin. Cloudflare KV is used to manage state across "plugin chain" runs. Basically in our config when we define multiple plugins to be invoked by a specific webhook (such as issues_comment.created) the kernel keeps track of what comes next per what job using KV. It seems that this plugin is using more than all of our plugins combined, and by 2-3x.

Is it realistic to check if it's the last event in the plugin chain and not keep track of it anymore? That way we can put the heavy ones at the end, like vector embeddings?

For example, issue comment created and we have three plugins. Kernel executes first and second normally, but it knows the last one is next so it executes and does not read/write KV. If it executes it I don't see why it needs to keep track anymore.

A straightforward way to optimize would be to divide this functionality into several plugins. For instance, "Issue Matching" could function as one action plugin, while "Issue Deduplication" could be another. We could further enhance efficiency by implementing batch processing for comments, rather than triggering actions every time a comment is edited or deleted.

Alternatively, we could maintain the current setup but use a Postgres connection URI instead of the Supabase key and URI. We could also implement the embedding generation as a Postgres function. We save close to 14 KV operations per invocation.

Another alternative would be to limit access to the anonymous key (Clear Text) as much as possible (RLS with Policy) and instead pass JWT tokens from the kernel. These tokens would then be used by the worker to make calls to the Supabase REST API.

0x4007 · 2024-10-01T19:29:48Z

A straightforward way to optimize would be to divide this functionality into several plugins. For instance, "Issue Matching" could function as one action plugin, while "Issue Deduplication" could be another. We could further enhance efficiency by implementing batch processing for comments, rather than triggering actions every time a comment is edited or deleted.

Anything on a timer is a no-go. Can you make batch processing events based?

How does breaking it apart into separate plugins help with this? Also won't it cause a lot of code duplication? I always prefer breaking apart plugins wherever possible for enhanced modularity.

Alternatively, we could maintain the current setup but use a Postgres connection URI instead of the Supabase key and URI. We could also implement the embedding generation as a Postgres function. We save close to 14 KV operations per invocation.

Saving 14 KV operations per invocations is massive. Lets do this immediately!

Another alternative would be to limit access to the anonymous key (Clear Text) as much as possible (RLS with Policy) and instead pass JWT tokens from the kernel. These tokens would then be used by the worker to make calls to the Supabase REST API.

This I don't understand how it helps.

whilefoo · 2024-10-01T20:38:54Z

Is it realistic to check if it's the last event in the plugin chain and not keep track of it anymore? That way we can put the heavy ones at the end, like vector embeddings?

Well technically we don't need to keep track if it's the last plugin or only 1 plugin in the chain, but that also means that we don't get to use the response from the plugin. We currently don't use it but we might in the future for example if plugin returns rewards to the kernel or returns comment html for kernel to post...

0x4007 · 2024-10-01T21:48:29Z

Seems janky to have a switch in the config to enable this feature dropStateOfLastPlugin: boolean but might be useful in a pinch.

gentlementlegen · 2024-10-02T03:06:27Z

There are plugins that run quite a lot like https://github.com/ubiquity-os-marketplace/automated-merging and https://github.com/ubiquity-os-marketplace/disqualifier when these would only need one run once a day (this would save hundreds of KV calls).I know you're against CRONS but finding something that would behave similarly would be very helpful.

0x4007 · 2024-10-02T04:37:18Z

Let's focus on the most prominent problem (vector embeddings plugin) and then work our way down to optimize others as needed.

I have some half baked ideas how to handle these "cron suitable" events. I think there's potential for a solution using my dropLastPluginState: boolean feature mentioned above. I don't love it because it doesn't seem elegant, but it seems simple to use and useful.

gentlementlegen · 2024-10-02T05:47:00Z

I think automated-merging and disqualifier consume more than vector-embeddings-plugin without reason. The vector embedding could maybe also run one big batch a day.

gentlementlegen · 2024-10-15T07:12:14Z

Coming back to this, after having the daemon-disqualifier running yesterday, we nearly used 100% in one day. I believe adding some CRON capability to the kernel would be very useful for recurring tasks like these, because they could run only once a day and accomplish the same job (also avoid the comment bombing that happened yesterday as well). We also now have 4 orgs running which potentially multiplies by 4 the usage.

My idea would be to add a CRON item in the configuration that would be available in every plugin where we could give a CRON like value. The kernel can keep track of these and call the plugin only when necessary. That would also de-clutter the action runs, which is now sitting at 5k runs and would be near impossible to debug. With a run once a day we could easily output a summary of which tasks have been updated, like daemon-merging does already.

0x4007 · 2024-10-16T05:21:34Z

disqualifer

Is disqualifier ignoring bot comments? If not then its recursively invoking itself. It's poorly implemented and should be redone.

"daemons"

For any "daemon" class plugin, if we want the clock to be frequent but also be smart about the use of our KV then here is a solution that combines my previous proposal:

We have a "queue job plugin" at the end of our commented event plugin chain.

All that does is act as a queue/buffer. It collects a queue of jobs to run, with a job nonce, and we can set the recurring runs per time interval (like four times a day) from within its configuration.

nonce

The job nonce should essentially deduplicate what would be redundant jobs, for example, following up on a particular issue (only needs to happen once per interval.) this could also be referred to as a job ID, which describes the type of action (a plugin developer defined action class name) and where it occurs (perhaps a node ID of an issue or pull)

actionClassName-nodeID

followUpIssue-1234

The benefit of this approach is that if nothing is in the queue, it should not attempt to run.

As a final optimization (although i realize now it might not be necessary) is that because it's in the end of the plugin chain, we can stop monitoring KV for any subsequent "daemon" events from the buffer/queue

devpool-directory-superintendent bot mentioned this issue Sep 30, 2024

KV Limits ubiquity/devpool-directory#1634

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KV Limits #121

KV Limits #121

0x4007 commented Sep 30, 2024

0x4007 commented Sep 30, 2024

whilefoo commented Sep 30, 2024

0x4007 commented Sep 30, 2024

gentlementlegen commented Oct 1, 2024

gentlementlegen commented Oct 1, 2024

0x4007 commented Oct 1, 2024 •

edited

Loading

sshivaditya2019 commented Oct 1, 2024 •

edited

Loading

0x4007 commented Oct 1, 2024 •

edited

Loading

whilefoo commented Oct 1, 2024

0x4007 commented Oct 1, 2024

gentlementlegen commented Oct 2, 2024 •

edited

Loading

0x4007 commented Oct 2, 2024 •

edited

Loading

gentlementlegen commented Oct 2, 2024

gentlementlegen commented Oct 15, 2024

0x4007 commented Oct 16, 2024 •

edited

Loading

KV Limits #121

KV Limits #121

Comments

0x4007 commented Sep 30, 2024

Projected Costs

Next Steps

0x4007 commented Sep 30, 2024

whilefoo commented Sep 30, 2024

0x4007 commented Sep 30, 2024

gentlementlegen commented Oct 1, 2024

gentlementlegen commented Oct 1, 2024

0x4007 commented Oct 1, 2024 • edited Loading

sshivaditya2019 commented Oct 1, 2024 • edited Loading

0x4007 commented Oct 1, 2024 • edited Loading

whilefoo commented Oct 1, 2024

0x4007 commented Oct 1, 2024

gentlementlegen commented Oct 2, 2024 • edited Loading

0x4007 commented Oct 2, 2024 • edited Loading

gentlementlegen commented Oct 2, 2024

gentlementlegen commented Oct 15, 2024

0x4007 commented Oct 16, 2024 • edited Loading

disqualifer

"daemons"

nonce

0x4007 commented Oct 1, 2024 •

edited

Loading

sshivaditya2019 commented Oct 1, 2024 •

edited

Loading

0x4007 commented Oct 1, 2024 •

edited

Loading

gentlementlegen commented Oct 2, 2024 •

edited

Loading

0x4007 commented Oct 2, 2024 •

edited

Loading

0x4007 commented Oct 16, 2024 •

edited

Loading