debuginfo: Composite type reform and enum support. #7710

michaelwoerister · 2013-07-11T10:59:47Z

This pull request includes various improvements:

Composite types (structs, tuples, boxes, etc) are now handled more cleanly by debuginfo generation. Most notably, field offsets are now extracted directly from LLVM types, as opposed to trying to reconstruct them. This leads to more stable handling of edge cases (e.g. packed structs or structs implementing drop).
debuginfo.rs in general has seen a major cleanup. This includes better formatting, more readable variable and function names, removal of dead code, and better factoring of functionality.
Handling of VariantInfo in ty.rs has been improved. That is, the type VariantInfo = @VariantInfo_ typedef has been replaced with explicit uses of @VariantInfo, and the duplicated logic for creating VariantInfo instances in ty::enum_variants() and typeck::check::mod::check_enum_variants() has been unified into a single constructor function. Both function now look nicer too :)
Debug info generation for enum types is now mostly supported. This includes:
- Good support for C-style enums. Both DWARF and gdb know how to handle them.
- Proper description of tuple- and struct-style enum variants as unions of structs.
- Proper handling of univariant enums without discriminator field.
- Unfortunately gdb always prints all possible interpretations of a union, so debug output of enums is verbose and unintuitive. Neither LLVM nor gdb support DWARF's DW_TAG_variant which allows to properly describe tagged unions. Adding support for this to LLVM seems doable. gdb however is another story. In the future we might be able to use gdb's Python scripting support to alleviate this problem. In agreement with @jdm this is not a high priority for now.
The debuginfo test suite has been extended with 14 test files including tests for packed structs (with Drop), boxed structs, boxed vecs, vec slices, c-style enums (standalone and embedded), empty enums, tuple- and struct-style enums, and various pointer types to the above.

~~What is not yet included is DI support for some enum edge-cases represented as described in trans::adt::NullablePointer.~~

Cheers,
Michael

PS: closes #7819, fixes #7712

jdm · 2013-07-11T14:02:58Z

\o/

I'll work my way through this today, hopefully.

jdm · 2013-07-12T19:58:03Z

One general naming comment: it seems like some functions now use a get_or_create prefix, while others just use create. Is this reflecting the caching nature of their implementation? That doesn't seem important to distinguish to me, and I think names like block_metadata and file_metadata would be descriptive of their purpose.

jdm · 2013-07-12T20:09:31Z

Style nits: I'd like to move away from multi-line unsafe blocks opening on the same line as existing code.

michaelwoerister · 2013-07-13T09:17:31Z

Style nits: I'd like to move away from multi-line unsafe blocks opening on the same line as existing code.

You mean something like this:

do member_name.as_c_str |member_name| { unsafe {
    llvm::LLVMDIBuilderCreateMemberType(
        DIB(cx),
        file_metadata,
        ...,
        0,
        member_type_metadata[i])
}}

would turn into the following?

do member_name.as_c_str |member_name| { 
    unsafe {
        llvm::LLVMDIBuilderCreateMemberType(
            DIB(cx),
            file_metadata,
            ...,
            0,
            member_type_metadata[i])
    }
}

michaelwoerister · 2013-07-13T09:33:32Z

...and I think names like block_metadata and file_metadata would be descriptive of their purpose.

Yeah, I definitely went for verbosity in the naming style. This was certainly influenced by my recent experiences of having to try to understand this code without much prior knowledge or documentation. My thinking behind this was that one should be able to get some insight into what's going on by just reading part of the code, because often a newcomer (or someone doing a quick fix in some code otherwise not known to them) will not be aware of the internal workings of these functions (such as that LLVM will merge duplicate metadata anyway and it is thus no semantic difference whether caching happens or not).

If you find this exceedingly ugly, we can change it of course. But otherwise I would still vote for some redundancy in the naming if it improves readability (even if it impedes writeability a bit).

That being said, I'm not completely satisfied with the naming in the module either. Maybe your approach with just file_metadata (with neither create_ nor get_or_create_ prefix) plus some good documentation in the form of comments would be the best solution.

jdm · 2013-07-15T19:50:06Z

   return create_composite_type_metadata(cx, Type::nil(), enum_name, &[], &[], &[], span);

Are the &s necessary here?

jdm · 2013-07-15T19:52:23Z

With regards to the stage0 thing, I'm inclined to just hold off merging this until we get a new snapshot that makes it irrelevant.

jdm · 2013-07-15T20:26:33Z

There are various instances of code like let variant_name : &str = cx.sess.str_of(variant_info.name);. The rustc style is to put the : immediately after the name.

jdm · 2013-07-15T21:33:05Z

src/librustc/middle/trans/debuginfo.rs

+    // For empty enums there is an early exit. Just describe it as an empty struct with the
+    // appropriate type name
+    if ty::type_is_empty(cx.tcx, enum_type) {
+        return create_composite_type_metadata(cx, Type::nil(), enum_name, &[], &[], &[], span);


Are the &s necessary here?

There is no more StructContext now. Better support for boxed vectors in there too.

…a().

michaelwoerister · 2013-07-19T07:41:52Z

Rebased :)

michaelwoerister · 2013-07-19T09:34:47Z

Sorry about that. Compilation seems to break on mac because some new C functions from RustWrapper.cpp cannot be found by the linker. I added their names to rustllvm.def.in in michaelwoerister@b52eb4a. I hope this fixes the issue. Please re-approve.

…predict for all possible platforms and configurations.

michaelwoerister · 2013-07-20T11:00:31Z

Commit michaelwoerister@a1303cc should make the test cases pass on 32 bit machines. The tests removed were no particularly good idea in the first place.

jdm · 2013-07-20T14:28:28Z

That makes sense.

@jdm

This pull request includes various improvements: + Composite types (structs, tuples, boxes, etc) are now handled more cleanly by debuginfo generation. Most notably, field offsets are now extracted directly from LLVM types, as opposed to trying to reconstruct them. This leads to more stable handling of edge cases (e.g. packed structs or structs implementing drop). + `debuginfo.rs` in general has seen a major cleanup. This includes better formatting, more readable variable and function names, removal of dead code, and better factoring of functionality. + Handling of `VariantInfo` in `ty.rs` has been improved. That is, the `type VariantInfo = @VariantInfo_` typedef has been replaced with explicit uses of @VariantInfo, and the duplicated logic for creating VariantInfo instances in `ty::enum_variants()` and `typeck::check::mod::check_enum_variants()` has been unified into a single constructor function. Both function now look nicer too :) + Debug info generation for enum types is now mostly supported. This includes: + Good support for C-style enums. Both DWARF and `gdb` know how to handle them. + Proper description of tuple- and struct-style enum variants as unions of structs. + Proper handling of univariant enums without discriminator field. + Unfortunately `gdb` always prints all possible interpretations of a union, so debug output of enums is verbose and unintuitive. Neither `LLVM` nor `gdb` support DWARF's `DW_TAG_variant` which allows to properly describe tagged unions. Adding support for this to `LLVM` seems doable. `gdb` however is another story. In the future we might be able to use `gdb`'s Python scripting support to alleviate this problem. In agreement with @jdm this is not a high priority for now. + The debuginfo test suite has been extended with 14 test files including tests for packed structs (with Drop), boxed structs, boxed vecs, vec slices, c-style enums (standalone and embedded), empty enums, tuple- and struct-style enums, and various pointer types to the above. ~~What is not yet included is DI support for some enum edge-cases represented as described in `trans::adt::NullablePointer`.~~ Cheers, Michael PS: closes #7819, fixes #7712

emberian mentioned this pull request Jul 12, 2013

Add debug representation of enums #1339

Closed

jdm reviewed Jul 15, 2013
View reviewed changes

michaelwoerister added 20 commits July 19, 2013 07:55

debuginfo: Refactoring of composite type info generation done.

f424e93

There is no more StructContext now. Better support for boxed vectors in there too.

debuginfo: Replaced vec::mapi with iterator version.

6230ec1

debuginfo: Added test cases for packed structs (/w drop)

99ebb81

debuginfo: Added support for c-style enums.

739f3ee

debuginfo: Support for tuple-style enums (WIP)

f389bd8

debuginfo: Better support for univariant tuple-style enums.

7cf0aac

debuginfo: Added support for struct-style enums.

3b06df4

debuginfo: Fixes related to changed memory layout of unique allocations

77a00cc

Cleanup of ty::VariantInfo and related functions.

12d87d3

debuginfo: Major code cleanup in debuginfo.rs

a33d1b8

debuginfo: Extended test suite with various tests for enums.

70e5c08

debuginfo: DI generation for enums uses adt::represent_type() now.

e0108a4

debuginfo: Fixed unique pointers to data containing managed pointers.

7af2e6e

debuginfo: Added support for Option<T>-like enums.

eed2d0e

debuginfo: Cleaned up style issues for pull request.

b2aeb4b

debuginfo: Adapted DI generation to new memory layout of unique vecs.

e9baeab

debuginfo: Added some documenting comments to debuginfo.rs

a1c5c79

debuginfo: Implemented trait_method branch in create_function_metadat…

72cf2ee

…a().

debuginfo: Fixed issue 7712.

d8c27c3

debuginfo: Fixed some merge fallout.

6aa43c7

debuginfo: Fixed some merge fallout

b52eb4a

debuginfo: Removed some test relying on data structure sizes hard to …

a1303cc

…predict for all possible platforms and configurations.

bors closed this Jul 20, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

debuginfo: Composite type reform and enum support. #7710

debuginfo: Composite type reform and enum support. #7710

Uh oh!

michaelwoerister commented Jul 11, 2013

Uh oh!

jdm commented Jul 11, 2013

Uh oh!

jdm commented Jul 12, 2013

Uh oh!

jdm commented Jul 12, 2013

Uh oh!

michaelwoerister commented Jul 13, 2013

Uh oh!

michaelwoerister commented Jul 13, 2013

Uh oh!

jdm commented Jul 15, 2013

Uh oh!

jdm commented Jul 15, 2013

Uh oh!

jdm commented Jul 15, 2013

Uh oh!

jdm Jul 15, 2013

Uh oh!

michaelwoerister commented Jul 19, 2013

Uh oh!

michaelwoerister commented Jul 19, 2013

Uh oh!

michaelwoerister commented Jul 20, 2013

Uh oh!

jdm commented Jul 20, 2013

Uh oh!

Uh oh!

debuginfo: Composite type reform and enum support. #7710

debuginfo: Composite type reform and enum support. #7710

Uh oh!

Conversation

michaelwoerister commented Jul 11, 2013

Uh oh!

jdm commented Jul 11, 2013

Uh oh!

jdm commented Jul 12, 2013

Uh oh!

jdm commented Jul 12, 2013

Uh oh!

michaelwoerister commented Jul 13, 2013

Uh oh!

michaelwoerister commented Jul 13, 2013

Uh oh!

jdm commented Jul 15, 2013

Uh oh!

jdm commented Jul 15, 2013

Uh oh!

jdm commented Jul 15, 2013

Uh oh!

jdm Jul 15, 2013

Choose a reason for hiding this comment

Uh oh!

michaelwoerister commented Jul 19, 2013

Uh oh!

michaelwoerister commented Jul 19, 2013

Uh oh!

michaelwoerister commented Jul 20, 2013

Uh oh!

jdm commented Jul 20, 2013

Uh oh!

Uh oh!