Move AI Logging Of Execution Into VSCode Frontend and out of RunMe GRPC Server #211

jlewi · 2024-08-27T15:35:28Z

Background

Foyle relies on logs of cell execution for feedback and training. Cell execution is currently logged from the RunMe GRPC server and not the vscode frontend. See stateful/runme#585. I think the backend was chosen over the frontend because I know GoLang much better than type script.

Logging from the backend has a number of disadvantages.

Complicates user setup
Can't collect additional types of feedback.
Complicates event processing
Requires AIService and Executor to be co-located.

User setup

Right now the user has to configure Foyle with the location of the RunMe logs (docs). This is brittle and friction. Now that learning is working pretty well and critical to Foyle's value proposition I'd like to remove this friction.

Can't collect additional types of feedback.

As noted in Tech Note 008 there are valuable events that we'd like to log Accepting/Rejecting AI suggested cells. These events would need to be logged from the front end and not the RunMe gRPC server.

Complicates event processing

Processing AI events for learning is complicated by the fact that Foyle needs to be aware of log management in RunMe. RunMe can launch multiple instances of the RunMe server each with their own log file. Foyle has to monitor all the log files and doesn't necessarily know when a log file is complete and will no longer have events being written to it.

Requires AI Service and Executor to be co-located

Since Foyle needs access to the RunMe executor logs the services need to be colocated.

Proposal

The proposed solution is to introduce an RPC into the Foyle AI service to LogEvents.

rpc LogEvents(LogEventsRequest) returns (LogEventsResponse) {}

Events can then be logged from the frontend via RPC.

We can instrument _doExecute to log executions
We can instrument handleOnDidChangeActiveTextEditor to log ghost cell acceptances
We can instrument processResponse to log ghost cell deletions

With this change we can turn on logging by default without requiring any additional configuration from users. The only thing they would need to configure would be the Foyle endpoint.

@sourishkrout what do you think? Any suggestions or concerns; particularly about the changes to the frontend?

The text was updated successfully, but these errors were encountered:

This PR defines the protos and logging service. It doesn't update the Analyzer to learn from these events. Related to: #211

sourishkrout · 2024-08-28T17:34:04Z

@sourishkrout what do you think? Any suggestions or concerns; particularly about the changes to the frontend?

Yes, when we initially considered porting Foyle on top of Runme, I also considered a more UI-aware approach first. However, it was a good call to lean into strength to PoC the integration. I'll review stateful/vscode-runme#1589 for feedback.

One terminology I mildly object to is the term "frontend," although it's no objection to the proposal. I find that thinking about an IDE, even if a stripped-down version that could run a backend-less "frontend," minimizes the IDE's role and its importance. While it's a longer-term goal of Runme to have the ability to run as a web extension (in a way that makes sense in browser-only), referring to it as "frontend," factually, it is not "just" a frontend app. My web development knee-jerk is over-indexing on its meaning in the web world. Anyways, just a side note, really.

jlewi · 2024-08-28T18:17:09Z

One terminology I mildly object to is the term "frontend," although it's no objection to the proposal.
Ack. Do you have a preferred terminology to distinguish between running in the vscode extension vs. the grpc server?

sourishkrout · 2024-08-28T18:26:40Z

One terminology I mildly object to is the term "frontend," although it's no objection to the proposal.
Ack. Do you have a preferred terminology to distinguish between running in the vscode extension vs. the grpc server?

Usually use "extension" for the whole concept and UI/UX (notebook, editor, terminal, ...) for literally the chrome that's facing the user.

This PR defines the protos and logging service. It doesn't update the Analyzer to learn from these events. Related to: #211

jlewi · 2024-09-02T20:48:59Z

Merge Front end changes Use the Foyle API to report Log Events stateful/vscode-runme#1589
Merge backend LogAnalysis should not rely on cell execution being recorded by LogEvents #222
Merge RunMe gRPC server cleanup Remove changes to support logging for AI stateful/runme#661
Create PR to remove the logging option in RunMe VSCode extension
Create PR to remove the logging option in RunMe gRPC server
* We should probably do this after one version release of the VSCode extension with the aiLogging option removed in the extension so it won't try to set the flag when starting the server

As described in jlewi/foyle#211 we will no longer rely on processing RunMe grpc logs to train the AI. This means we can simplify the Runme logging code and revert some of the changes in #585 * Related to stateful/vscode-runme#1589

…ents (#222) Now that we no longer need to process RunMe logs we can simplify log processing * For the watermark we can just keep track of a single file and its offset * Only the latest file will be active. Therefore we don't need to watch the filesystem for modifications we can just periodically scan the logs. ## Changes to BlockLog Proto * We can remove ExecTraceIds * Since the AI doesn't handle executions we don't have traces for execution * We should rely on the Frontend sending details of execution (e.g. execute code) which it currently isn't * Since LogEvents are processed in order we know that the most recent cell execution will be the final one reported by a LogEvent * Add a field to record suggestion status * Right now we only get log events for cells being accepted we don't record cells being rejected but we can reasonably infer that unaccepted cells have been rejected ## Config Changes * We can deprecate the logDirs field in learner since users should no longer need to configure it to monitor RunMe Logs * Simplifying the setup process of the learner is one of the main motivations of this PR. ## Analyzer Changes *. We no longer need CombineTraces functions for RunMe and Execute traces * We just accumulate LogEvents on the BlockLog object as we sequentially process the logs. ## Other changes * Add an RPC method to check the logs status. * This will report the watermark * Add ZapProto function to handle logging protos that include RunMe objects * We can't rely on the custom go plugin to generate the MarshalObject function because the RunMe protos aren't using that plugin Related to #211

jlewi · 2024-09-13T23:01:55Z

Frontend should have released
https://github.com/stateful/vscode-runme/releases/tag/3.7.5
So now we just need to create a PR to remove the logging option.

* This effectively reverts #1380 * Per jlewi/foyle#211 we are no longer relying on RunMe logs * Rather RunMe now uses Foyle APIs to send events to Foyle as necessary.

* The bug is described in #215 (comment) * The bug is that `Config.getTrainingDirs` returns no training directories if config.learner == nil * Prior to #211 config.learner would be non-nil because we had to set the path of the RunMe logs * However, now that we no longer depend on RunMe logs config.learner could be nil and this would return no training directories. In which case learner.Reconcile would not attempt to save any examples * The fix is to allow config.Learner to be nil in GetTrainingDirs and return a suitable default. We also need to ensure the training directory gets created if it doesn't exist. * This is fixed by jlewi/monogo#23 which updates LocalFileHelper to create the directory if it doesn't exist. * My suspicion is that I never hit this bug because i originally created my ~/.foyle/training directory using a version of the code which wasn't using FileHelper and explicitly checked and created the directory. I suspect when I refactored the code to support saving examples to GCS thats when the code to ensure the directory exists got dropped.

* This flag is deprecated and no longer used. * The frontend should have stopped using this flag starting in 3.7.5 * The frontend is now up to 3.8.5 * So we should be good to remove this flag because the frontend should no longer be setting it. * Related to jlewi/foyle#211

jlewi added a commit that referenced this issue Aug 27, 2024

Define an RPC to log events from the RunMe frontend.

e0702bc

This PR defines the protos and logging service. It doesn't update the Analyzer to learn from these events. Related to: #211

This was referenced Aug 27, 2024

Define an RPC to log events from the RunMe frontend. #213

Merged

Use the Foyle API to report Log Events stateful/vscode-runme#1589

Merged

jlewi added a commit that referenced this issue Aug 29, 2024

Define an RPC to log events from the RunMe frontend. (#213)

c5117ac

This PR defines the protos and logging service. It doesn't update the Analyzer to learn from these events. Related to: #211

This was referenced Aug 30, 2024

LogAnalysis should not rely on cell execution being recorded by LogEvents #222

Merged

Remove changes to support logging for AI stateful/runme#661

Merged

jlewi mentioned this issue Sep 2, 2024

Simplify Getting Started with Foyle and RunMe #224

Open

6 tasks

This was referenced Sep 14, 2024

Remove option to enable aiLogs in the server stateful/vscode-runme#1661

Merged

User Report: Learning Isn't Happening #215

Closed

jlewi mentioned this issue Sep 18, 2024

Bug Fix: Learning doesn't happen because exampleDirs is empty #245

Merged

jlewi mentioned this issue Oct 2, 2024

Cleanup: Remove the ai-logs flag. stateful/runme#679

Merged

jlewi closed this as completed Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move AI Logging Of Execution Into VSCode Frontend and out of RunMe GRPC Server #211

Move AI Logging Of Execution Into VSCode Frontend and out of RunMe GRPC Server #211

jlewi commented Aug 27, 2024

sourishkrout commented Aug 28, 2024

jlewi commented Aug 28, 2024

sourishkrout commented Aug 28, 2024

jlewi commented Sep 2, 2024 •

edited

Loading

jlewi commented Sep 13, 2024

Move AI Logging Of Execution Into VSCode Frontend and out of RunMe GRPC Server #211

Move AI Logging Of Execution Into VSCode Frontend and out of RunMe GRPC Server #211

Comments

jlewi commented Aug 27, 2024

Background

Logging from the backend has a number of disadvantages.

User setup

Can't collect additional types of feedback.

Complicates event processing

Requires AI Service and Executor to be co-located

Proposal

sourishkrout commented Aug 28, 2024

jlewi commented Aug 28, 2024

sourishkrout commented Aug 28, 2024

jlewi commented Sep 2, 2024 • edited Loading

jlewi commented Sep 13, 2024

jlewi commented Sep 2, 2024 •

edited

Loading