-
Notifications
You must be signed in to change notification settings - Fork 325
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Option to dump OpenLineage events that correspond to dataset/namespace from web #1927
Comments
I would add a
|
It would also be nice if Marquez UI would have a little button at the bottom that, when pressed, will reveal a console panel that outputs all the raw OL events that it received - acting sort of like a kind of debug console. Typically, users would use marquez to visualize the events (after they get stored in its backend DB) - but also may want to monitor how the OL messages are actually being received. |
Perhaps there could be a sortable/filterable/paginated table in the UI for lineage events. There have been many times where I just wanted to see "what got emitted" or "did my pipeline do it right?" It would be an excellent debugging tool, and could help people build OL integrations more quickly. Does an API exist that could support this? If not, one could possibly serve both a) data export and b) debugging use cases. |
@wslulciuc @howardyoo @rossturk Do we want to
I think the API would look differently depending on decision here. Of course, the first option is the simplest. |
@rossturk , BTW, there is a workaround to this issue now, of using OL proxy between the client side and marquez to evesdrop the raw events that gets received. The setup would be to have OL proxy in the front, and setup its |
Rather, my opinion would be to be able to filter more on even types (like COMPLETE, FAIL, etc) that may be more useful, or based on particular time period. |
I think the answer to @mobuchowski is: yes to all three. As a user, I want all three of those things. What I'm imagining is a table with filter/sort controls at the top and page controls at the bottom. The columns could more or less match the underlying DB table. I agree that filtering on dataset or job is more difficult and marginally less interesting 👍 |
Having this feature would make debugging, and replicating errors much faster.
The text was updated successfully, but these errors were encountered: