Fix vararg ABI #14

kinke · 2014-10-24T16:52:33Z

Requires ldc-developers/ldc#768 and addresses ldc-developers/ldc#702 and ldc-developers/ldc#73 (for Win64 only!).

kinke · 2014-10-25T00:16:47Z

This reduces the failing tests to 41 on Win64:

core.thread (SEGFAULT)
std.algorithm (Failed)
std.complex (Failed)
std.conv (Failed)
std.csv (Failed)
std.math (Failed)
std.numeric (Failed)
std.parallelism (SEGFAULT)
std.path (SEGFAULT)
std.process (Failed)
std.regex (SEGFAULT)
std.stdio (Failed)
std.stream (SEGFAULT)
std.string (SEGFAULT)
std.uni (SEGFAULT)
std.uri (SEGFAULT)
std.zlib (SEGFAULT)
std.net.isemail (SEGFAULT)
std.internal.math.errorfunction (Failed)
std.internal.math.gammafunction (Failed)
std.algorithm-debug (Failed)
std.complex-debug (Failed)
std.conv-debug (Failed)
std.csv-debug (Failed)
std.datetime-debug (SEGFAULT)
std.json-debug (Failed)
std.math-debug (Failed)
std.numeric-debug (Failed)
std.parallelism-debug (SEGFAULT)
std.path-debug (SEGFAULT)
std.process-debug (Failed)
std.regex-debug (SEGFAULT)
std.socket-debug (Failed)
std.stdio-debug (Failed)
std.stream-debug (SEGFAULT)
std.uni-debug (SEGFAULT)
std.uri-debug (SEGFAULT)
std.zlib-debug (SEGFAULT)
std.net.isemail-debug (SEGFAULT)
std.internal.math.errorfunction-debug (Failed)
std.internal.math.gammafunction-debug (Failed)

kinke · 2014-10-25T14:08:55Z

extern(D) doesn't seem to be too way off. This:

import core.vararg;
import core.stdc.stdio;

extern(C) void logC(const(char)* str, ...)
{
    va_list args;
    va_start(args, str);
    vprintf(str, args);
    va_end(args);
}

void logD(const(char)* str, ...)
{
    va_list args;
    va_start(args, str);
    assert(args == _argptr);

    foreach (ti; _arguments)
        printf("  %s [size %d]\n", ti.toString().ptr, ti.tsize);
    vprintf(str, _argptr);
}

void main()
{
    immutable fl = 3.0f;
    logC  ("logC:   %d %f %f %lld %f %s\n", 1, 2.0, fl, 0x4_0000_0000L, fl + 2.0f, "lala".ptr);
    logD  ("logD:   %d %f %f %lld %f %s\n", 1, 2.0, fl, 0x4_0000_0000L, fl + 2.0f, "lala".ptr);
    printf("printf: %d %f %f %lld %f %s\n", 1, 2.0, fl, 0x4_0000_0000L, fl + 2.0f, "lala".ptr);
}

yields:

logC:   1 2.000000 3.000000 17179869184 5.000000 lala
  int [size 4]
  double [size 8]
  immutable(float) [size 4]
  long [size 8]
  float [size 4]
  immutable(char)* [size 8]
logD:   1 2.000000 0.000000 17179869184 0.000000 lala
printf: 1 2.000000 3.000000 17179869184 5.000000 lala

So there's a problem with floats for extern(D). It looks as if floats are never passed directly for varargs and converted to double instead. E.g., there's no printf type specifier distinguishing between float and double (only a l prefix for longdouble). This explains the produced .ll output:

define void @_D5hello4logDFPxaYv({ i64, %object.TypeInfo** } %._arguments, i8* noalias nocapture %._argptr, i8* %str_arg) #0 {
    ...
}

define i32 @_Dmain({ i64, { i64, i8* }* } %unnamed) #0 {
  %fl = alloca float, align 4
  %_argptr_storage = alloca { i64, double, i64, i64, i64, i8* }
  store float 3.000000e+00, float* %fl
  %1 = load float* %fl
  call void (i8*, ...)* @logC(i8* getelementptr inbounds ([29 x i8]* @.str2, i32 0, i32 0), i32 1, double 2.000000e+00, double 3.000000e+00, i64 17179869184, double 5.000000e+00, i8* getelementptr inbounds ([5 x i8]* @.str3, i32 0, i32 0))
  %2 = getelementptr { i64, double, i64, i64, i64, i8* }* %_argptr_storage, i32 0, i32 0
  %3 = bitcast i64* %2 to i32*
  store i32 1, i32* %3
  %4 = getelementptr { i64, double, i64, i64, i64, i8* }* %_argptr_storage, i32 0, i32 1
  store double 2.000000e+00, double* %4
  %5 = getelementptr { i64, double, i64, i64, i64, i8* }* %_argptr_storage, i32 0, i32 2
  %6 = bitcast i64* %5 to float*
  store float 3.000000e+00, float* %6
  %7 = getelementptr { i64, double, i64, i64, i64, i8* }* %_argptr_storage, i32 0, i32 3
  store i64 17179869184, i64* %7
  %8 = getelementptr { i64, double, i64, i64, i64, i8* }* %_argptr_storage, i32 0, i32 4
  %9 = bitcast i64* %8 to float*
  store float 5.000000e+00, float* %9
  %10 = getelementptr { i64, double, i64, i64, i64, i8* }* %_argptr_storage, i32 0, i32 5
  store i8* getelementptr inbounds ([5 x i8]* @.str4, i32 0, i32 0), i8** %10
  %11 = load { i64, %object.TypeInfo** }* @._arguments.array
  %12 = bitcast { i64, double, i64, i64, i64, i8* }* %_argptr_storage to i8*
  call void @_D5hello4logDFPxaYv({ i64, %object.TypeInfo** } %11, i8* noalias nocapture %12, i8* getelementptr inbounds ([28 x i8]* @.str5, i32 0, i32 0))
  %tmp = call i32 (i8*, ...)* @printf(i8* getelementptr inbounds ([29 x i8]* @.str6, i32 0, i32 0), i32 1, double 2.000000e+00, double 3.000000e+00, i64 17179869184, double 5.000000e+00, i8* getelementptr inbounds ([5 x i8]* @.str7, i32 0, i32 0))
  ret i32 0
}

i.e., the 2 floats are passed to logC() as double literals while they are stored in a 64-bit slot for the _argptr_storage struct without converting to double before. If we want C-ABI compliance in this case, we'd have to convert all floats to double, involving the TypeInfo[] _arguments parameter as well.

The next thing to be tested are structs/static arrays > 64 bits passed to a vararg extern(D) function. The core.stdc.stdarg.va_arg() functions assume the struct is passed byval (pointer to hidden copy) as that's what's done for the C ABI. I fear putting one into the _argptr_storage struct for extern(D) currently means that either the struct is allocated directly in there or that a pointer to it (the original struct, not a hidden copy => byref passing, ldc-developers/ldc#172) is stored.

kinke · 2014-10-25T15:02:07Z

structs are currently stored directly inside the _argptr_storage struct, so the va_arg() functions fail horribly for bigger ones. This can be tested by inspecting the .ll output of:

import core.vararg;
import core.stdc.stdio;

struct Small { long a; }
struct Big { long a, b; }

void variadic(T)(...)
{
    foreach (ti; _arguments)
        printf("  %s [size %d]\n", ti.toString().ptr, ti.tsize);
    static if (!is(T == struct))
    {
        auto format = "%s " ~ (__traits(isFloating, T) ? "%f" : "%d") ~ "\n";
        printf(format.ptr, typeid(T).toString().ptr, va_arg!T(_argptr));
    }
}

void main()
{    
    variadic!int(1);
    variadic!double(2.0);
    variadic!float(3.0f);

    Small small = { 4L };
    variadic!Small(small);

    Big big = { 5L, 6L };
    variadic!Big(big);
}

At least for x86_64, it seems it would be much easier and cleaner if we declared a variadic D function as a normal LLVM (C) variadic function, except for the injected TypeInfo[] parameter, and so got rid of the _argptr_storage struct. The _argptr pointer would be allocated and initialized to what the LLVM va_start intrinsic returns at the beginning of the variadic D function body. Argument transformations (byval struct passing, cast to int...) would need to be moved to the call sites (at least for the optional ones), fixing ldc-developers/ldc#172 at the same time.

I don't know how compatible other platforms would be to such an approach, but iirc, @redstar mentioned that the PowerPC vararg ABI is similar.

dnadlinger · 2014-10-25T15:56:01Z

At least for x86_64, it seems it would be much easier and cleaner if we declared a variadic D function as a normal LLVM (C) variadic function

Yes, that's been my plan/suggestion all along. Sorry if I was ambiguous.

kinke · 2014-10-25T16:34:40Z

For all platforms? Or are there any legacy considerations, e.g., for 32-bit x86?

kinke · 2014-10-26T20:18:56Z

Edited title and description to reflect the LDC changes in ldc-developers/ldc#768.

kinke · 2014-10-27T00:08:56Z

Added a generic experimental version of both files, currently lacking x86_64 support for non-Windows.

dnadlinger · 2014-11-16T19:33:45Z

src/core/stdc/stdarg.d

+    // struct passed by reference. We define va_list as a raw pointer
+    // (to the actual struct) for the byref semantics and allocate
+    // the struct in LDC's va_start and va_copy intrinsics.
+    alias char* va_list;


I'd rather actually make this __va_list* or something so that code that assumes x86-style variadics just doesn't compile instead of crashing horribly at runtime.

Hmm I see the point, but there's a problem with the hidden argptr. It is currently defined as Type::tvalist, i.e., char, see gen/target.cpp:81. If we change it to a hardcoded core.vararg.__va_list_ for System V, I guess the user must import core.vararg when implementing a variadic extern(D) function, even when not accessing _argptr at all.

Okay, I see the problem. We should probably fix this at some point by moving __va_list into object and properly initializing Type::tvalist, but that can wait for now.

System V AMD64 ABI still unsupported because I can't experiment with it.

LLVM's intrinsic apparently doesn't work for aggregate types.

Fix vararg ABI

kinke force-pushed the vararg branch from 7c70d65 to 472c982 Compare October 24, 2014 17:09

kinke mentioned this pull request Oct 25, 2014

Win64 failing unittests ldc-developers/ldc#758

Closed

kinke force-pushed the vararg branch from 472c982 to fe8e8a9 Compare October 26, 2014 15:08

kinke mentioned this pull request Oct 26, 2014

Varargs fix & ABI refactoring ldc-developers/ldc#768

Merged

kinke changed the title ~~Win64: fix vararg ABI for extern(C)~~ Win64: fix vararg ABI Oct 26, 2014

kinke force-pushed the vararg branch from 7bd091f to 97f0f95 Compare November 16, 2014 10:44

kinke changed the title ~~Win64: fix vararg ABI~~ Fix vararg ABI Nov 16, 2014

dnadlinger reviewed Nov 16, 2014
View reviewed changes

kinke force-pushed the vararg branch from a8d8d7e to db9322a Compare November 22, 2014 10:30

kinke force-pushed the vararg branch from db9322a to 385cbe4 Compare January 17, 2015 23:15

kinke mentioned this pull request Feb 16, 2015

Adapt to druntime changes concerning core.stdc.stdarg.va_list. ldc-developers/dmd-testsuite#3

Merged

kinke force-pushed the vararg branch from 385cbe4 to 6784cb5 Compare February 22, 2015 17:10

kinke added 5 commits February 23, 2015 21:45

Win64: fix vararg ABI for extern(C)

a0a4e3d

Experimental generic vararg ABI fix & clean-up.

a7cf630

System V AMD64 ABI still unsupported because I can't experiment with it.

core.stdc.stdarg: re-add support for System V AMD64 ABI.

6028120

System V AMD64 fix: LLVM's va_arg intrinsic isn't fully usable yet.

a8a7cd6

core.stdc.stdarg: fix va_arg() for x86 platform.

78b0712

LLVM's intrinsic apparently doesn't work for aggregate types.

kinke force-pushed the vararg branch from 6784cb5 to 78b0712 Compare February 23, 2015 20:46

redstar added a commit that referenced this pull request Feb 24, 2015

Merge pull request #14 from kinke/vararg

201726a

Fix vararg ABI

redstar merged commit 201726a into ldc-developers:ldc Feb 24, 2015

This was referenced Feb 25, 2015

core.stdc.stdarg doesn't work on 64 bits ldc-developers/ldc#73

Closed

Vararg functions does not compile on x86_64 ldc-developers/ldc#702

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix vararg ABI #14

Fix vararg ABI #14

Uh oh!

kinke commented Oct 24, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

dnadlinger commented Oct 25, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

kinke commented Oct 26, 2014

Uh oh!

kinke commented Oct 27, 2014

Uh oh!

dnadlinger Nov 16, 2014

Uh oh!

kinke Nov 16, 2014

Uh oh!

dnadlinger Nov 16, 2014

Uh oh!

Uh oh!

Fix vararg ABI #14

Fix vararg ABI #14

Uh oh!

Conversation

kinke commented Oct 24, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

dnadlinger commented Oct 25, 2014

Uh oh!

kinke commented Oct 25, 2014

Uh oh!

kinke commented Oct 26, 2014

Uh oh!

kinke commented Oct 27, 2014

Uh oh!

dnadlinger Nov 16, 2014

Choose a reason for hiding this comment

Uh oh!

kinke Nov 16, 2014

Choose a reason for hiding this comment

Uh oh!

dnadlinger Nov 16, 2014

Choose a reason for hiding this comment

Uh oh!

Uh oh!