Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Limit address sets to include at most one string pointer #808

Merged
merged 10 commits into from
Aug 3, 2022

Conversation

jerhard
Copy link
Member

@jerhard jerhard commented Aug 1, 2022

This PR changes the abstraction of a string pointer from StrPtr of string to StrPtr of string option. Now, a StrPtr None abstracts any possible string pointer.

As there is the possibility that there may the same string in the binary multiple times, the function is_definite was changed so that is_definite (StrPtr _) now returns false.

Closes #807

@michael-schwarz michael-schwarz self-requested a review August 1, 2022 14:00
@sim642 sim642 self-requested a review August 1, 2022 14:29
Copy link
Member

@michael-schwarz michael-schwarz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this behavior should be configurable via some option. Apart from that, it looks good!

@jerhard
Copy link
Member Author

jerhard commented Aug 1, 2022

I think this behavior should be configurable via some option. Apart from that, it looks good!

Should this option be active by default or not? Probably makes sense to enable it for the defaults configurations for large and mid-sized programs.

@sim642
Copy link
Member

sim642 commented Aug 2, 2022

I think this behavior should be configurable via some option.

Having an option is good indeed since we still have some use cases for distinguishing string pointers. Given the choice now, it looks to me like a choice between unit domain and flat domain (of strings) for StrPtr. Maybe it could even be implemented as such with an option like ana.base.strings.domain?

@jerhard
Copy link
Member Author

jerhard commented Aug 2, 2022

Given the choice now, it looks to me like a choice between unit domain and flat domain (of strings) for StrPtr.

The flat domain of strings would, as-is, result in the behavior that limits address sets to contain at most one string pointer: The join of two different non-bottom elements yields Top in the flat lattice. The unit lattice would not be able to track any string pointer.

In case we want to implement the tracking of string pointers via other domains, one would require a flat-domain that raises a Lattice.Uncomparable when joining different non-bottom elements, if we want to keep the current behavior as an option.

@sim642
Copy link
Member

sim642 commented Aug 2, 2022

The flat domain of strings would, as-is, result in the behavior that limits address sets to contain at most one string pointer: The join of two different non-bottom elements yields Top in the flat lattice. The unit lattice would not be able to track any string pointer.

In case we want to implement the tracking of string pointers via other domains, one would require a flat-domain that raises a Lattice.Uncomparable when joining different non-bottom elements, if we want to keep the current behavior as an option.

True, it'd be a flat lattice without lifted bottom and top. Lattice.Fake would be like that.

Copy link
Member

@michael-schwarz michael-schwarz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think as it currently is is a good state. Replacing the string components with a further lattice only makes the code harder to read IMO.

@michael-schwarz michael-schwarz added unsound performance Analysis time, memory usage labels Aug 2, 2022
@jerhard
Copy link
Member Author

jerhard commented Aug 2, 2022

This assert currently succeeds, but in the concrete there's nothing saying the compiler must use the same constant.

In this PR, this asserts is unknown, as it should be, as I changed the is_definite (StrPtr _) to return false; as I mentioned above.

@michael-schwarz
Copy link
Member

Yes, sorry, I did not read your comment above closely enough!

@michael-schwarz michael-schwarz merged commit 26fd6ce into master Aug 3, 2022
@michael-schwarz michael-schwarz deleted the limit_string_pointers branch August 3, 2022 07:20
michael-schwarz added a commit that referenced this pull request Aug 3, 2022
@jerhard
Copy link
Member Author

jerhard commented Aug 3, 2022

With this option activated (by default), and the following command line, goblint terminates in 11:45h:

/goblint ../../benchmarks/sqlite-amalgamation-3370200/sqlite3.c ../../benchmarks/sqlite-amalgamation-3370200/sqlite3.h ../../benchmarks/sqlite-amalgamation-3370200/sqlite3ext.h ../../benchmarks/sqlite-amalgamation-3370200/shell.c -v --set pre.cppflags[+] -DSQLITE_DEBUG --disable ana.base.context.non-ptr --disable ana.int.def_exc --disable sem.unknown_function.spawn --set ana.thread.domain plain --enable exp.earlyglobs --set ana.base.privatization none --set pre.cppflags[+] -DGOBLINT_NO_BSEARCH --set pre.cppflags[+] -DGOBLINT_NO_ASSERT --set result json-messages --set ana.activated "['base', 'mallocWrapper']" --set ana.ctx_insens[+] base --set ana.ctx_insens[+] mallocWrapper

With the option deactivated, Goblint still has not terminated after running 17:51h.

@sim642 sim642 mentioned this pull request Aug 3, 2022
6 tasks
@sim642 sim642 added this to the v2.0.0 milestone Aug 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Analysis time, memory usage unsound
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Only allow one StrPtr in addressets
3 participants