Debug info: Fixup value out of range #930

kinke · 2015-05-10T10:01:25Z

Environment: Win64, VS 2015, LDC merge-2.067 (+ my VS15 patches + kinke/druntime@1add4f0), LLVM master (5de99601367aaf09ba1d17cd51478bde5e1b74fe).

All druntime debug unittests compile fine and all Phobos ones too, except for std.string and only when using the -g switch to generate debug infos. This snippet:

import std.conv, std.string, std.typetuple;

void invoke(alias dg)() { dg(); }

void main()
{
    invoke!({
    foreach (S; TypeTuple!( char[], const( char)[], immutable( char)[],
                           wchar[], const(wchar)[], immutable(wchar)[],
                           dchar[], const(dchar)[], immutable(dchar)[]))
    {
        foreach (T; TypeTuple!( char[], const( char)[], immutable( char)[],
                               wchar[], const(wchar)[], immutable(wchar)[],
                               dchar[], const(dchar)[], immutable(dchar)[]))
        (){
            foreach(R; TypeTuple!(dchar[dchar], const dchar[dchar],
                        immutable dchar[dchar]))
            {
                R tt = ['h' : 'q', 'l' : '5'];
                assert(translate(to!S("hello world"), tt, to!T("r"))
                    == to!S("qe55o wo5d"));
                assert(translate(to!S("hello world"), tt, to!T("helo"))
                    == to!S(" wrd"));
                assert(translate(to!S("hello world"), tt, to!T("q5"))
                    == to!S("qe55o wor5d"));
            }
        }();
    }
    });
}

yields the following when using the -g switch, and otherwise compiles fine:

Assertion failed: isIntN(Size * 8 + 1, Value) && "Value does not fit in the Fixup field", file C:\LDC\llvm\lib\Target\X86\MCTargetDesc\X86AsmBackend.cpp, line 120
    ldc2.exe!`anonymous namespace'::X86AsmBackend::applyFixup(const llvm::MCFixup & Fixup, char * Data, unsigned int DataSize, unsigned __int64 Value, bool IsPCRel) Line 119   C++
    ldc2.exe!llvm::MCAssembler::Finish() Line 962   C++
    ldc2.exe!llvm::MCObjectStreamer::FinishImpl() Line 433  C++
    ldc2.exe!llvm::MCWinCOFFStreamer::FinishImpl() Line 268 C++
    ldc2.exe!`anonymous namespace'::X86WinCOFFStreamer::FinishImpl() Line 48    C++
    ldc2.exe!llvm::MCStreamer::Finish() Line 583    C++
    ldc2.exe!llvm::AsmPrinter::doFinalization(llvm::Module & M) Line 1154   C++
    ldc2.exe!llvm::FPPassManager::doFinalization(llvm::Module & M) Line 1575    C++
    ldc2.exe!`anonymous namespace'::MPPassManager::runOnModule(llvm::Module & M) Line 1632  C++
    ldc2.exe!llvm::legacy::PassManagerImpl::run(llvm::Module & M) Line 1722 C++
    ldc2.exe!llvm::legacy::PassManager::run(llvm::Module & M) Line 1756 C++
    ldc2.exe!codegenModule(llvm::TargetMachine & Target, llvm::Module & m, llvm::raw_fd_ostream & out, llvm::TargetMachine::CodeGenFileType fileType) Line 126  C++
    ldc2.exe!writeModule(llvm::Module * m, std::basic_string<char,std::char_traits<char>,std::allocator<char> > filename) Line 527  C++
    ldc2.exe!main(int argc, char * * argv) Line 1361    C++

We have a llvm::MCFixup of kind FK_Data_2 (16 bits), but its value is bigger (78893 in my case).

The text was updated successfully, but these errors were encountered:

kinke · 2015-05-10T10:37:59Z

When excluding std.string from the Phobos test runner, 91% of the debug druntime+Phobos unittests pass, the following 26 fail:

std.array-debug
std.bigint-debug
std.csv-debug
std.datetime-debug
std.encoding-debug
std.math-debug
std.parallelism-debug
std.path-debug
std.process-debug
std.socket-debug
std.stream-debug
std.traits-debug
std.uni-debug
std.uri-debug
std.zip-debug
std.zlib-debug
std.algorithm.sorting-debug
std.digest.crc-debug
std.digest.md-debug
std.digest.ripemd-debug
std.digest.sha-debug
std.net.isemail-debug
std.regex.internal.parser-debug
std.regex.internal.tests-debug
std.regex-debug
std.internal.math.gammafunction-debug

redstar · 2015-05-11T05:21:04Z

This is an LVVM bug. You should create a .ll from the D code and run bugpoint on it. The reduced file can then be attached to an LLVM bug report.

kinke · 2015-05-12T20:34:08Z

Hmm, not sure at which point this happens. ldc2 -g -c crashes, but it's able to produce the corresponding .ll, .bc and .s files. bugpoint -run-llc tells me the .bc is fine, and opt is happily working too. bugpoint seems to be pretty limited without gcc...

dnadlinger · 2015-05-12T20:35:52Z

Are you running opt with the same set of passes? (~~Try -std-compile-opts for a start~~, and passing -debug-pass=Arguments to LDC might be useful. Edit: Hmm, I see, this occurs in debug mode).

You can actually use bugpoint to link an executable with whatever options you need, but it's been a while since I last did that. It won't be relevant here anyway.

kinke · 2015-05-12T20:45:36Z

I've used '-O3', now checking in combination with -std-link-opts (there's no -std-compile-opts) [edit: that works fine]. Thx for the quick response.

kinke · 2015-05-22T22:19:52Z

Could this be related to #442? The mentioned LLVM/clang bug seems to be..

redstar · 2015-05-24T10:02:09Z

Yes, seems to be the same bug.

kinke · 2016-03-04T18:04:15Z

IR in case someone like @majnemer would be interested: link

majnemer · 2016-03-04T23:16:18Z

@kinke File is busted for me, I get:
fixup.ll:1484:1: error: expected instruction opcode

kinke · 2016-03-04T23:52:28Z

Thanks for looking into this too! Line 1484 would still be part of the huge display name extending from the previous line all the way down to line 3963!
Not sure why the .ll file appears to be corrupt. I'll upload a zipped version incl. the bitcode file if that's of any use... here you go.

majnemer · 2016-03-05T00:44:05Z

@kinke Thanks!

kinke · 2016-03-05T11:26:47Z

@majnemer With an updated LDC and LLVM (git dbe4385), I now hit another assertion in another place:

Assertion failed: (isUIntN(8 * Size, Value) || isIntN(8 * Size, Value)) && "Invalid size", file C:\LDC\llvm\lib\MC\MCStreamer.cpp, line 82
#5 0x000007f89044116f _wassert (C:\Windows\SYSTEM32\ucrtbase.DLL+0x6116f)
#6 0x000007f7dd15b56b llvm::MCStreamer::EmitIntValue(unsigned __int64,unsigned int) c:\ldc\llvm\lib\mc\mcstreamer.cpp:84:0
#7 0x000007f7dd767c4c llvm::CodeViewDebug::emitTypeInformation(void) c:\ldc\llvm\lib\codegen\asmprinter\codeviewdebug.cpp:278:0
#8 0x000007f7dd768349 llvm::CodeViewDebug::endModule(void) c:\ldc\llvm\lib\codegen\asmprinter\codeviewdebug.cpp:206:0
#9 0x000007f7dd7360cd llvm::AsmPrinter::doFinalization(class llvm::Module &) c:\ldc\llvm\lib\codegen\asmprinter\asmprinter.cpp:1124:0
#10 0x000007f7dcea732a llvm::FPPassManager::doFinalization(class llvm::Module &) c:\ldc\llvm\lib\ir\legacypassmanager.cpp:1589:0
#11 0x000007f7dceaf919 llvm::FPPassManager::runOnModule(class llvm::Module &) c:\ldc\llvm\lib\ir\legacypassmanager.cpp:1645:0
#12 0x000007f7dceaebf6 llvm::legacy::PassManagerImpl::run(class llvm::Module &) c:\ldc\llvm\lib\ir\legacypassmanager.cpp:1731:0
#13 0x000007f7dc9f357f llvm::VectorType::classof(class llvm::Type const *) c:\ldc\ldc\driver\toobj.cpp:96:0
#14 0x000007f7dc9f7745 writeModule(class llvm::Module *,class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> >) c:\ldc\ldc\driver\toobj.cpp:443:0
#15 0x000007f7dc9e50a6 ldc::CodeGenerator::writeAndFreeLLModule(char const *) c:\ldc\ldc\driver\codegenerator.cpp:120:0
#16 0x000007f7dc9e4064 ldc::CodeGenerator::finishLLModule(class Module *) c:\ldc\ldc\driver\codegenerator.cpp:96:0
#17 0x000007f7dc9e3f04 ldc::CodeGenerator::emit(class Module *) c:\ldc\ldc\driver\codegenerator.cpp:158:0
#18 0x000007f7dca0ad0d main c:\ldc\ldc\driver\main.cpp:1284:0

The interesting location is CodeViewDebug.cpp:277, which limits the length of the display name to a little less than 64K. In our case, it's about 80K...

rainers · 2016-03-05T11:59:27Z

BTW: DMD (the reference D compiler) trims CodeView symbol names to 0xffd8, as the linker bails out with larger names anyway: https://github.com/D-Programming-Language/dmd/blob/master/src/backend/cv8.c#L45

majnemer · 2016-03-06T10:52:05Z

For the clang/c++ side of things, this is taken care of by an interesting trick: all symbols > 4k get their linkage name md5 hashed. The has is then substituted for the linkage name.

I do not like the DMD approach to this because it silently drops the end of the linkage name. Instead, I'd rather have us skip functions with names that are too large instead of pretending to support the behavior.

Is there a reason why using a scheme akin to the MSVC one would be undesirable for LDC?

rainers · 2016-03-06T13:04:24Z

For the clang/c++ side of things, this is taken care of by an interesting trick: all symbols > 4k get their linkage name md5 hashed. The has is then substituted for the linkage name.

DMD does similar for the ancient OMF object files: if the (compressed) symbol does not fit into 128(!) characters, it is cut off around 100 and the MD5 hash is appended. I guess we should do something similar for symbols in COFF files.

Is there a reason why using a scheme akin to the MSVC one would be undesirable for LDC?

You mean the compression of symbols with backward references? This should be possible, but the C++ version is rather limited IIRC. OMF symbol compression uses LZW or similar, but generates undecodable symbols in case the name contains non-ASCII characters (D allows any "universal alphas").

The biggest downside to changing the mangling is probably that it has to creep into a number of build tools supporting only the current one.

A limitation of the MS linker and enforced by LLVM. Fixes issue ldc-developers#930.

A limitation of the MS linker/COFF format and enforced by LLVM. Fixes issue ldc-developers#930.

kinke · 2016-03-12T22:46:02Z

I'm not closing this yet as I've hit this old assertion just now again with bleeding edge LLVM, for the std.algorithm.setops unittests with debug infos.

majnemer · 2016-03-12T23:25:28Z

@kinke Could you post the failing IR?

kinke · 2016-03-13T00:31:02Z

It's 110 MB last time I checked. ;)

majnemer · 2016-03-13T00:58:08Z

The size isn't a problem for me, I'd still be more than happy to take a look.

kinke · 2016-03-13T01:00:46Z

But compressed it's only 1.4 MB.
This compiles fine with git dbe4385 from about one week ago. Oh and I got this for today's git a5b740a.

majnemer · 2016-03-13T07:51:44Z

Figured it out.
Your IR has a function called _D3std9algorithm6setops342__T16cartesianProductTS3std5range57__T8SequenceVAyaa1_6eTS3std8typecons12__T5TupleTiZ5TupleZ8SequenceTS3std5range57__T8SequenceVAyaa1_6eTS3std8typecons12__T5TupleTiZ5TupleZ8S...
The size of the function's name is 139368 bytes and it appears not to have a DISubprogram associated with it.
Because it has no DISubprogram, we use the linkage name for the display name.
This makes the S_GPROC32_ID's record size greater than the maximum of 0xFFFF.

majnemer · 2016-03-13T10:59:37Z

I ended up fixing this LLVM r263378. We will perform the truncation regardless of where we got the name, you can probably remove the truncation logic on your side.

kinke · 2016-03-13T13:52:22Z

The size of the function's name is 139368 bytes

😜

it appears not to have a DISubprogram associated with it

Hmm, has a bad smell to it...

I ended up fixing this LLVM r263378. We will perform the truncation regardless of where we got the name

Makes sense imo. Thanks a lot!

kinke · 2016-03-13T16:22:23Z

@majnemer I've just used current LLVM head. Your fix works for the latest issue, but I still have to truncate the display name of DISubprograms (the original issue). Is that intended?

kinke · 2016-03-13T17:15:30Z

Well the LLVM truncation fix allows to compile the modules, but then the MS linker complains:

setops-unittest-debug.obj : fatal error LNK1103: debugging information corrupt; recompile module
Error: `"C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\amd64\link.exe"` failed with status: 1103

rainers · 2016-03-13T17:30:36Z

@majnemer One of the truncations uses 0xfff6 as the upper limit (lib/CodeGen/AsmPrinter/CodeViewDebug.cpp, line 712). This seems too large. @kinke, you might want to try reducing it to 0xffd8, too.

kinke · 2016-03-13T17:52:25Z

Thx Rainer, my machine is already building...

kinke · 2016-03-13T18:22:59Z

Okay so using 0xffd8 in both locations gets rid of the linker complaints (0xffd9 at the other site was already too high). DISubprogram display names still need to be truncated by us as the original one is used here.

majnemer · 2016-03-13T22:25:55Z

You can't use 0xffd9 for the variable side of things? Also, I don't understand the bit about DISubprogram. Does it crash in LLVM, what is the failure moe?

kinke · 2016-03-13T23:45:36Z

You can't use 0xffd9 for the variable side of things?

Yep, otherwise the MS linker would complain for the 'setops' source (x64 at least, not sure about x86).

About DISubprogram: yes I hit this LLVM assertion again when not truncating it on our side.

majnemer · 2016-03-14T00:03:11Z

Can I get IR for the DISubprogram case?

kinke · 2016-03-14T00:07:36Z

It's the 'fixup' one earlier in this thread (#930 (comment)) - I don't think it has changed a lot. If need be I can recreate it after a recompile...

majnemer · 2016-03-14T05:20:56Z

Thanks, I switched to 0xffd8 and fixed the DISubprogram issue in r263408.

kinke · 2016-03-15T11:03:01Z

Thanks! I'll check it when I find some time.

kinke · 2016-03-16T09:50:40Z

I've tested it with yesterday's LLVM, works perfectly without any truncation from our side. Thx!

A limitation of the MS linker/COFF format and enforced by LLVM. Fixes issue ldc-developers#930.

kinke mentioned this issue Aug 20, 2015

OS X: Corrupted EH tables (_Unwind_RaiseException) due to system linker bug #1031

Closed

This was referenced Sep 20, 2015

Collection of remaining Win64 issues (1) #1103

Closed

[2.068] std.datetime unittest compile error #1127

Closed

kinke mentioned this issue Sep 30, 2015

Fix debug infos of nested variables #1134

Merged

rainers mentioned this issue Jan 24, 2016

[WIP] support building for x86 with MSVC runtime #1168

Closed

kinke mentioned this issue Feb 16, 2016

Unittest compilation of std.string crash (Windows): too deep recursion #1297

Closed

kinke added a commit to kinke/ldc that referenced this issue Mar 10, 2016

MSVC: Limit debug info symbol names to ~64k

a072b8a

A limitation of the MS linker and enforced by LLVM. Fixes issue ldc-developers#930.

kinke mentioned this issue Mar 10, 2016

MSVC: Limit debug info symbol names to ~64k #1348

Merged

kinke added a commit to kinke/ldc that referenced this issue Mar 12, 2016

MSVC: Limit debug info display names to ~64k

5ba7ce9

A limitation of the MS linker/COFF format and enforced by LLVM. Fixes issue ldc-developers#930.

kinke closed this as completed Mar 16, 2016

Syniurge pushed a commit to Syniurge/Calypso that referenced this issue Sep 20, 2016

MSVC: Limit debug info display names to ~64k

a04b997

A limitation of the MS linker/COFF format and enforced by LLVM. Fixes issue ldc-developers#930.

Debug info: Fixup value out of range #930

Debug info: Fixup value out of range #930

Comments

kinke commented May 10, 2015

kinke commented May 10, 2015

redstar commented May 11, 2015

kinke commented May 12, 2015

dnadlinger commented May 12, 2015

kinke commented May 12, 2015

kinke commented May 22, 2015

redstar commented May 24, 2015

kinke commented Mar 4, 2016

majnemer commented Mar 4, 2016

kinke commented Mar 4, 2016

majnemer commented Mar 5, 2016

kinke commented Mar 5, 2016

rainers commented Mar 5, 2016

majnemer commented Mar 6, 2016

rainers commented Mar 6, 2016

kinke commented Mar 12, 2016

majnemer commented Mar 12, 2016

kinke commented Mar 13, 2016

majnemer commented Mar 13, 2016

kinke commented Mar 13, 2016

majnemer commented Mar 13, 2016

majnemer commented Mar 13, 2016

kinke commented Mar 13, 2016

kinke commented Mar 13, 2016

kinke commented Mar 13, 2016

rainers commented Mar 13, 2016

kinke commented Mar 13, 2016

kinke commented Mar 13, 2016

majnemer commented Mar 13, 2016

kinke commented Mar 13, 2016

majnemer commented Mar 14, 2016

kinke commented Mar 14, 2016

majnemer commented Mar 14, 2016

kinke commented Mar 15, 2016

kinke commented Mar 16, 2016