-
Notifications
You must be signed in to change notification settings - Fork 715
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Event blocks #546
Event blocks #546
Conversation
This simplifies the V3 format. I also spit out the code for the old format so it would be easier to remove it later.
…d it's lenght does not come first
# Conflicts: # src/FastSerialization/FastSerialization.cs
src/TraceEvent/TraceEvent.csproj
Outdated
@@ -3,6 +3,7 @@ | |||
|
|||
<PropertyGroup> | |||
<TargetFrameworks>net45;netstandard1.6</TargetFrameworks> | |||
<LangVersion>7.0</LangVersion> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Including @sharwell
Please remove this and update the declaration in src\Directory.Build.props instead. That way all projects benefit from it uniformly.
I am OK moving to C# 7. It means a dependency on VS 2017, but realistically we have probably already many dependencies on 2017 (msbuild files, nuget support, bug fixes, ...)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@vancem done
|
||
var entryObj = _deserializer.GetEntryObject(); // this call invokes FromStream and reads header data | ||
#endif | ||
_deserializer.RegisterFactory("!EventTrace.", delegate { return this; }); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do the names of the types begin with ! and end with .? If it is just to avoid collision, we should not worry about this, as the namespace is just THIS FILE and no more. I would prefer the names 'EventTrace' and 'EventBlock' (I could even be talked into simply 'Trace' and 'EventBlock' as these are the names you would use if you had to come up with names that need only be unique in this file. (We may end up with other kinds of block, and frankly 'Block' is not descriptive enough on its own'.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not very proud of this, but..
Every object is serialized as:
<BeginObject> size is 1
<BeginObject> size is 2
<NullReference> size is 3
<objectVersion> size is 7
<minReaderVersion> size is 11
<stringLength> size is 15
<string> size is 15 + stringLength
</EndObject> size is 16 + stringLength
<Content /> <-- this must be aligned, so stringLength has to be aligned too ;)
</EndObject>
So I decided to enforce a rule that type name length must be a multiplication of 4. This + padding the content ensures that everything is aligned. I know that it sounds stupid, but I did not have better idea. The other idea for names I had was EventBlock4B
@vancem is that acceptable?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What you described is SUPER subtle and fragile (any change will break alignment). You don't want one piece code (what goes in the headers) to be responsible for a subtle invariant needed later on (the alignment of a block).
In general, whenever you write a piece of code, you should be thinking about some programmer (who might by you), a year later trying to figure out your code, and WANTING TO LEARN AS LITTLE AS POSSIBLE about it to get his job done. Thus you want everything to be 'obvious' and 'explicit' as possible. Being 'smart' is a BAD idea unless there is a compelling reason to (e.g. it improves perf in a non-trivial way, and then you have to pay that 'smartness' by documenting it) Sometimes we fall short, but this is the goal.
Given that you added a new 'Padding' tag, I figured what you did was add these to the 'tail' of the object before. But thinking about it, I don't like that either, because it places the burden of alignment on the PREVISOUS object, and it is also pretty subtle (you need to know that there will be an end-object byte afterward and adjust for that).
All of that is too subtle. The 'obvious' thing to do is to 'align when you need it' INSIDE objects. Thus where we need alignment is INSIDE the EventBlock. Thus after emitting the block size, the writer will simply emit 0 to 3 bytes until the output pointer is 4 byte aligned (this counts as part of the payload). Similarly, the reader, after reading the block size will read bytes until its input pointer is 4 byte aligned. This is just a few lines of code, and makes alignment very clear and explicit (frankly you don't need asserts, since you FORCED the alignment at that point.
Notice this approach is robust (no matter what is happening OUTSIDE this object, you get the alignment you want. It is LOCAL (only code associated with EventBlock needs to care, and it is EXPLICIT (there is code (near where you would care), that has the 'job' of providing alignment and programmers can see it (it is not a subtle ramification of something that seems to have another purpose).
I would like us to use this approach (or something equally 'obvious') and then you can rip out the subtle names ! and . in the names).
Seems like you also don't need to change IFastSerializable either.
|
||
// Because we told the deserialize to use 'this' when creating a EventPipeFile, we | ||
// expect the entry object to be 'this'. | ||
Debug.Assert(entryObj == this); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why did we remove the assert? (of course it goes after 'entryObj' is defined, but it is still correct and valuable (as well as the comment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I removed this assert because I implemented unit tests to verify the correct behavior of this code. (CanParseHeaderOfV1EventPipeFile and CanParseHeaderOfV3EventPipeFile)
imho we should have more unit tests and fewer debug.asserts (I understand that we use Debug.Assert here as a debug-only code contracts) to make the code more easy to understand and maintain.
I reverted this change.
{ | ||
// Guess that the event is < 1000 bytes or whatever is left in the stream. | ||
int eventSizeGuess = Math.Min(1000, _endOfEventStream.Sub(reader.Current)); | ||
nint eventSizeGuess = |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As a rule I don't like to use ?: except in cases where it clearly fits on a line. For everything else use if statements.
However I think I was trying to be too smart and just made the code complex. Time to simplify. For the first call to Get Pointer simply use EventPipeHeaderSize. Then make the call to GetPointer down below unconditional. This is simpler and probably as efficient as what is there now.
@@ -8,11 +10,15 @@ | |||
using System.Diagnostics; | |||
using System.Runtime.InteropServices; | |||
|
|||
using nint = System.Int32; // as of today the deserializer is limited to 32 bit, in case we change in the future we don't want to miss any cast |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't bother with nint. While we might relax the serialization so that the TOTAL size of the file is > 4Gig (in that case StreamLabel needs to be bigger), nint today is only used to measure the size of single event, and I think we can safely assume we are not likely to make THAT bigger than 4Gig.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you are right, it was over-engineering
Debug.Assert(0 < eventData->TotalEventSize && eventData->TotalEventSize < 0x20000); // TODO really should be 64K but BulkSurvivingObjectRanges needs fixing. | ||
|
||
nint eventDataEndIncludingAlignmentPadding = (nint)reader.Current + eventData->TotalEventSize; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be
StreamLabel eventDataEndIncludingAlignmentPadding = reader.Current.Add(eventData->TotalEventSize);
For what it is worth, I think you could just call it eventDataEnd be clear. This is especially true because there is no padding in the code for an individual event (each event should start on a 4 byte boundary, and everything in the event is also a multiple of 4 bytes, so you can ignore alignment issues).
EventPipeEventHeader* eventData = (EventPipeEventHeader*)reader.GetPointer(eventSizeGuess); | ||
|
||
// Basic sanity checks. Are the timestamps and sizes sane. | ||
Debug.Assert(sessionEndTimeQPC <= eventData->TimeStamp); | ||
Debug.Assert(sessionEndTimeQPC == 0 || eventData->TimeStamp - sessionEndTimeQPC < _QPCFreq * 24 * 3600); | ||
Debug.Assert(0 <= eventData->PayloadSize && eventData->PayloadSize <= eventData->TotalEventSize); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would also assert that that the Version < 3 or
eventData->Payload is 4 byte aligned. and eventData->PayloadSize is a multiple of 4.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good idea!
{ | ||
deserializer.Read(out pointerSize); | ||
deserializer.Read(out _processId); | ||
deserializer.Read(out numberOfProcessors); | ||
deserializer.Read(out _expectedCPUSamplingRate); | ||
// TODO: alig |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can remove the TODO.
_eventRecord = (TraceEventNativeMethods.EVENT_RECORD*)Marshal.AllocHGlobal(sizeof(TraceEventNativeMethods.EVENT_RECORD)); | ||
ClearMemory(_eventRecord, sizeof(TraceEventNativeMethods.EVENT_RECORD)); | ||
|
||
if (pointerSize == 4) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is another case where I prefer using if than ?:. (because it does not fit on a line) In general ?: is hard to read unless it is really short. Yes, you are cloning the _eventRecord->EventHeader.Flags part but that is OK.
-
_eventRecord->EventHeader.Flags = TraceEventNativeMethods.EVENT_HEADER_FLAG_64_BIT_HEADER;
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have simplified it to: (ushort)(pointerSize * 8)
// Dispatch through all the events. | ||
PinnedStreamReader deserializerReader = (PinnedStreamReader)deserializer.Reader; | ||
|
||
// Align to a 4 byte boundary |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I assume that the way this works is that you aligned the EventPipeEventBlock as a whole so there was no need to do alignment inside the block.
You can simply delete the *ptr variable however (since it is not used except for asserts), and add an assert that _startEventData is 4 byte aligned above).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I put a few relatively minor requests for change, but on the whole it looks fine.
namespace Microsoft.Diagnostics.Tracing | ||
{ | ||
/// <summary> | ||
/// EventPipeEventSource knows how to decode EventPipe (generated by the .NET | ||
/// core runtime). | ||
/// core runtime). Please see |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit, looks like the 'Please see' is incomplete.
_eventRecord->EventHeader.Flags = TraceEventNativeMethods.EVENT_HEADER_FLAG_32_BIT_HEADER; | ||
else | ||
_eventRecord->EventHeader.Flags = TraceEventNativeMethods.EVENT_HEADER_FLAG_64_BIT_HEADER; | ||
_eventRecord->EventHeader.Flags = (ushort)(pointerSize * 8); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please don't set the flags this way (I prefer the original). The issue is that the code is now COMPLETELY obscure (you have to know the specific values of EVENT_HEADER_FLAG_*_BIT_HEADER, to understand why this work. This is the kind of thing you let optimizer do.
But more to the point, it is my JOB to do performance investigations/improvements and the very first thing I tell people, is that generally speaking 90+% of all your code is NOT performance sensitive and you should be writing your code to maximize clarity, and maintainability and other metrics. Only in that last < 10% (it is usually more like 1%) of your code does it matter.
So is this the hot 10%? The answer is easy: no. This code only gets called when metadata events are seen, and those should be probably less than 1% of the trace. Thus without even doing a measurement I can tell you that this code is NOT the hot 10% and thus we should be optimizing for clarity.
Now you can debate me whether the 'if' or the ?: is clearer, but I would generally argue that if you have the choice between using a rare construct/pattern and using a common one/cliche, you should use the common one. 'if' is MUCH more common that ?:, and thus should be preferred unless there is a clear advantage (e.g. shrinking 4 lines to 1 is reasonable advantage, but that only works if resulting line is short. If you have to break it up again, you lose the advantage.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like micro-optimizations, but not that much ;)
For me, it was the just simplest way to write it.
I reverted the change.
@vancem done |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The only changes I want are
- Make the type names normal names.
- Make the alignment explicit in EventBlock. (or some equally simple/explicit way of doing it).
|
||
#endif | ||
_deserializer.RegisterFactory("!EventTrace.", delegate { return this; }); | ||
_deserializer.RegisterFactory("!EventBlock.", delegate { return new EventPipeEventBlock(this); }); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These names should just be probably 'Trace' and 'EventBlock'.
@adamsitnik I want to thank you for your work on this PR. I know that I can be a bit of a pain, and probably more pedantic than is good, but I am very pleased with the result, and want to thank you for your work on this. Good Job! |
@vancem Big thanks for the review. And patience! I can see that I can learn a LOT from you. |
Support for dotnet/coreclr#16107