Small convenience improvements for Instrumentation API #24

smarr · 2016-01-29T17:35:57Z

No description provided.

This is a concenience method to correct the tags of a source section at a later point, for instance during paring. In my handwritten recursive descent parser, I don't know the complete context during parsing (i could pass it in, but that would make things more complicated), so, I would like to be able to change the tags of a subexpression when I am back in the parent expression where I assemble the AST. Signed-off-by: Stefan Marr <git@stefan-marr.de>

chumer · 2016-01-29T17:44:30Z

truffle/com.oracle.truffle.api/src/com/oracle/truffle/api/source/SourceSection.java

                            (identifier != null ? " identifier=" + identifier : "") + " code=" + getCode();
        }
+        if (tags != null && tags.length > 0) {
+            result = " tags: " + Arrays.toString(tags);


this should be a += ,right?

How did that ever work? hm, probably it didn't. yes, you are right. will fix.

Show the tags in the string representation, helpful for debugging. Signed-off-by: Stefan Marr <git@stefan-marr.de>

smarr · 2016-01-31T18:02:12Z

I think this needs more discussion before merging.
Points raised elsewhere:

should tags be kept here as a mutable field?
should source sections be unique?

For the last point, this is something I wonder, but doesn't seem to make a lot of sense. Unique source sections, i.e., only one object and one set of tags for a give range. This seems problematic because it might be language dependent or specific to parser/AST structure. Also overlapping source sections are easily constructed, also for identical range. Would be hard to guarantee uniqueness.

jtulach · 2016-02-01T09:42:01Z

truffle/com.oracle.truffle.api/src/com/oracle/truffle/api/source/SourceSection.java

@@ -292,6 +299,16 @@ public boolean equals(Object obj) {
    }

    /**


Missing Javadoc.

chumer · 2016-02-01T11:15:50Z

should tags be kept here as a mutable field?

No. SourceSection must be immutable. It is used in HashMaps as key a lot. Since nodes are mutable already its not a big deal if Node#sourceSection is mutable instead.

should source sections be unique?

Unique for a particular Source, or globally unique? I don't think we need that. A proper Equality check is good enough I believe.

mlvdv · 2016-02-01T22:42:10Z

@chumer you mentioned using SourceSection in HashMaps, but haven't responded to my question in the instrumentation review about how tags affect equality. The current overrides of hashcode() and equals() ignore tags.

smarr · 2016-02-01T22:46:17Z

For my use cases, I actually added tags to the hash (prime * result + Arrays.hashCode(tags)) and to the quality (return Arrays.equals(tags, other.tags);).

With respect to the 'uniqueness' discussion above, I feel that makes most sense.

mlvdv · 2016-02-01T23:13:52Z

Might adding tags to the hash make SourceSection unsuitable for building maps of source locations that ignore tags? In the new Instrumentation code there is no longer any way to "remember" locations in guest language code other than SourceSection querying. So if the hashcode includes tags, then changing tags causes existing queries stop matching places where they originally matched, and this is true for all clients.

I continue to see evidence that forcing storage of tags into the SourceSection is inappropriate. A "tag" is conceptually a different kind of information, as I've described in a Skype chat about Instrumentation. It doesn't say anything fundamental about the program, only that some tool has been instructed to behave a certain way in this particular situation. Some clients will care about this, others will not. We're already discovering that the binding times for the two kinds of information are quite different; the need to "replace" information that has already been recorded is a clue.

smarr · 2016-02-02T09:28:29Z

Hm, source section querying? Do you have an example for what that is used?

Building the DynamicMetrics tool, I actually found a couple of things that surprised me.

For instance, I annotated message sends, and I annotated field reads. In Newspeak (SOMns) there is however no separate syntax for field reads. So, I end up with two different source sections for the same piece of source code. One annotated as message send, and one annotated as field read.

In the tool that actually works nicely. I render field reads as blue, and sends as italic. That combines nicely. And, as a reader of this highlighted code, I get a lot of information.

However, some might say, I abuse the instrumentation framework by exposing such dynamic information. But I tend to disagree. It works very nicely for me.

But back to the question, in such a scenario, what would you be querying for, and which of the source sections would you want? or perhaps both?

mlvdv · 2016-02-05T18:26:44Z

Source section querying:

The user sets a breakpoint (e.g. "the statement on line 42 in some source") and expects feedback whether the specified location actually exists. "Don't know yet" is sometimes the right answer. The debugger must (and already does) do extra work to manage "unresolved" breakpoints for files that haven't been loaded. But the feedback is an important usability issue. If the user thinks the location exists, but incorrectly, behavior is mysterious: an instance of the well-known "hidden state" usability failure.

mlvdv · 2016-02-05T18:41:44Z

For instance, I annotated message sends, and I annotated field reads. In Newspeak (SOMns) there is however no separate syntax for field reads. So, I end up with two different source sections for the same piece of source code. One annotated as message send, and one annotated as field read.

This came up when I was working with the Irvine Python people: two nodes can (correctly) have the same SourceSection. In your case, querying with one tag or the other gets you the right location.
Getting multiple "locations" is also a reasonable result that can reveal something in the case you mention, for example if your query had both tags or just specified "any" tag.

I'll propose a stronger requirement: a client must be able to distinguish between the case where (when all SourceSections are the same) a single node has two tags and where two separate node each have one of the tags.

mlvdv · 2016-02-05T18:53:38Z

However, some might say, I abuse the instrumentation framework by exposing such dynamic information. But I tend to disagree. It works very nicely for me.

I strongly disagree with what "some might say"! I've been trying to get information out of source code and in front of programmers usefully for many years; that's the whole point of the Instrumentation framework. It is also why I argue constantly for flexibility and openness in the APIs: I named it a "Framework" and I refer to it as a "tool kit". Anybody should be able to create, as easily as possible, developer tools we have not yet imagined.

chumer · 2016-02-09T10:53:33Z

I will add hashCode, toString and equals implementation to SourceSection for tags. I missed that, thanks.
A cloneWithTags makes sense as well will add that. But with a little more documentation and testing.

Ok if I close this pull-request?

@mlvdv with the EventNodeFactory you can map mulitple "AST location" to one and the same SourceSection. If you care about same SourceSections beeing different in a Map you can use an IdentityHashMap in your tool implementation.

smarr · 2016-02-09T11:00:18Z

Ok, closing it.

jtulach · 2016-02-09T15:11:32Z

Veto: I am against the name cloneWithTags, I am OK with withTags.

chumer · 2016-02-09T15:17:34Z

Ok. Will add withTags.

mlvdv · 2016-02-17T20:00:18Z

@chumer

@mlvdv with the EventNodeFactory you can map mulitple "AST location" to one and the same SourceSection. If you care about same SourceSections beeing different in a Map you can use an IdentityHashMap in your tool implementation.

If I were to use an IdentityHashMap in that fashion, what guarantee would I be depending on about SourceSection identity:

where would this guarantee be documented?
how would this guarantee be enforced for language implementations, since they are responsible for creating and assigning SourceSection objects?

…dary to master * commit 'f5dac94d8934f7d0425e3ebf469d31dff82cefb5': Call ThreadLocal<ContextStore>.get() via TruffleBoundary

* Fix writing of unnamed module * Ignore empty package; serialize null-class-loader as 'bootstrap' * Fix checkstyle complaint * Handle null string in JfrSymbolRepository as 0 ID/reference * Don't special-case 'null' module name, serialize it as 0

Mark suite files for 23.1.5 release

smarr added the feature label Jan 29, 2016

smarr assigned chumer Jan 29, 2016

chumer reviewed Jan 29, 2016
View reviewed changes

Add tags to SourceSection.toString()

3a51395

Show the tags in the string representation, helpful for debugging. Signed-off-by: Stefan Marr <git@stefan-marr.de>

smarr force-pushed the feature/instrumentation_api branch from 6694614 to 3a51395 Compare January 29, 2016 18:19

jtulach reviewed Feb 1, 2016
View reviewed changes

thomaswue added the oracle-emp label Feb 1, 2016

smarr added the on hold label Feb 2, 2016

smarr closed this Feb 9, 2016

dougxc pushed a commit that referenced this pull request Apr 22, 2016

Merge pull request #24 in G/truffle from paw/fix--missing-truffle-bun…

4ee22d1

…dary to master * commit 'f5dac94d8934f7d0425e3ebf469d31dff82cefb5': Call ThreadLocal<ContextStore>.get() via TruffleBoundary

smarr deleted the feature/instrumentation_api branch June 22, 2016 13:26

brunoborges unassigned chumer Jan 8, 2018

simonis pushed a commit to simonis/graal that referenced this pull request Nov 20, 2024

Merge pull request oracle#24 from zakkak/2024-10-17-october-cpu

566dab1

Mark suite files for 23.1.5 release

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small convenience improvements for Instrumentation API #24

Small convenience improvements for Instrumentation API #24

smarr commented Jan 29, 2016

chumer Jan 29, 2016

smarr Jan 29, 2016

smarr commented Jan 31, 2016

jtulach Feb 1, 2016

chumer commented Feb 1, 2016

mlvdv commented Feb 1, 2016

smarr commented Feb 1, 2016

mlvdv commented Feb 1, 2016

smarr commented Feb 2, 2016

mlvdv commented Feb 5, 2016

mlvdv commented Feb 5, 2016

mlvdv commented Feb 5, 2016

chumer commented Feb 9, 2016

smarr commented Feb 9, 2016

jtulach commented Feb 9, 2016

chumer commented Feb 9, 2016

mlvdv commented Feb 17, 2016

		@@ -292,6 +299,16 @@ public boolean equals(Object obj) {
		}

		/**

Small convenience improvements for Instrumentation API #24

Small convenience improvements for Instrumentation API #24

Conversation

smarr commented Jan 29, 2016

chumer Jan 29, 2016

Choose a reason for hiding this comment

smarr Jan 29, 2016

Choose a reason for hiding this comment

smarr commented Jan 31, 2016

jtulach Feb 1, 2016

Choose a reason for hiding this comment

chumer commented Feb 1, 2016

mlvdv commented Feb 1, 2016

smarr commented Feb 1, 2016

mlvdv commented Feb 1, 2016

smarr commented Feb 2, 2016

mlvdv commented Feb 5, 2016

mlvdv commented Feb 5, 2016

mlvdv commented Feb 5, 2016

chumer commented Feb 9, 2016

smarr commented Feb 9, 2016

jtulach commented Feb 9, 2016

chumer commented Feb 9, 2016

mlvdv commented Feb 17, 2016