Fix Brittleness In Logs Processing With Source Detection #232

jlewi · 2024-09-12T19:14:08Z

Our logs processing pipeline is based on using the source of the logging statement to figure out how to parse/post process the logging line. This is brittle because if the code is refactored we change the source location. Its also brittle because its easy to get the function name wrong.

For example here the log is inside an anonymous function

foyle/app/pkg/agent/agent.go

Line 530 in 9d44e44

    
           _, span := tp.Start(ctx, "LogEvent", trace.WithAttributes(attribute.String("eventType", event.Type.String()), attribute.String("contextId", event.ContextId), attribute.String("selectedCellId", event.SelectedId)))

As a result the function name in the logging statement ends up being

github.com/jlewi/foyle/app/pkg/agent.(*Agent).LogEvents.func1

One approach we've taken to starting to deal with this is

Defining constants for the function name
Defining unittests that use reflection to get the name of the function and ensure its equal to the constant

https://github.com/jlewi/foyle/pull/230/files#diff-66abb051aad86c8af4c128d3fa1e65c47197c44e23d821a75346e59c833baa92R21

This is imperfect because it doesn't actually check that there is a logging statement there. For example, the test won't catch the case where the logging statement is nested inside an anonymous function.

I wonder if we could fix this by doing more source code analysis? For example, could we use the same tooling used by linters to find the lines associated with a log statement and get the function name associated with it?

The text was updated successfully, but these errors were encountered:

jlewi · 2024-09-12T19:19:05Z

Some suggestions from Claude
https://gist.github.com/jlewi/6058026b421867bf72febdead72e70b3

* Learning was broken because cell executions were not being properly extracted from LogEvents and added to the BlockLog. * The problem is that the logging of LogEvents happens inside an anonymous function. Therefore the function name generated by GoLang has a suffix like "func1"; therefore our code in the Analyzer was not correctly matching it and therefore log events were not being processed. * The fix is to match by using `HasPrefix`. This is still brittle. * Per #232 we'd like to start to fix this brittleness by using reflection and source code analysis in unittests to ensure the correctness of matchers for LogEntries. * This PR introduces a matchers package which verifies correctness of the function name but doesn't properly catch the anonymous function which broke learning here * Update the troubleshooting guide for learning * Move the troubleshooting guide into the docs page * Update the instructions for fetching the blocklogs; we now use the connect protocol ## Strip metadata from markdown * Create a simple tool to strip cell ids from markdown. * This is hack to make it easy to use RunMe markdown docs with Hugo; see stateful/runme#663

jlewi mentioned this issue Sep 12, 2024

Fix Learning: Cell Executions Not Being Added to BlockLogs #233

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Brittleness In Logs Processing With Source Detection #232

Fix Brittleness In Logs Processing With Source Detection #232

jlewi commented Sep 12, 2024

jlewi commented Sep 12, 2024

Fix Brittleness In Logs Processing With Source Detection #232

Fix Brittleness In Logs Processing With Source Detection #232

Comments

jlewi commented Sep 12, 2024

jlewi commented Sep 12, 2024