NVPTX support for new asm! #72439

westernmagic · 2020-05-21T21:56:47Z

This PR implements the new asm! syntax for the nvptx64-nvidia-cuda target.

rust-highfive · 2020-05-21T21:56:49Z

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @davidtwco (or someone else) soon.

If any changes to this PR are deemed necessary, please add them as extra commits. This ensures that the reviewer can see what has changed since they last reviewed the code. Due to the way GitHub handles out-of-date commits, this should also make it reasonably obvious what issues have or haven't been addressed. Large or tricky changes may require several passes of review and changes.

Please see the contribution instructions for more information.

src/test/assembly/asm/nvptx-types.rs

src/librustc_target/asm/nvptx.rs

src/test/assembly/asm/nvptx-types.rs

Amanieu · 2020-05-21T22:38:05Z

You are missing an nvptx-modifiers.rs test which tests that register names are rendered properly in the output asm.

westernmagic · 2020-05-22T12:57:01Z

You are missing an nvptx-modifiers.rs test which tests that register names are rendered properly in the output asm.

As discussed on zulip, NVPTX does not support any modifiers, therefore no tests are needed.

src/test/assembly/asm/nvptx-types.rs

src/librustc_target/asm/nvptx.rs

src/librustc_target/asm/mod.rs

Amanieu · 2020-05-22T18:17:26Z

The code looks good!

Can you update the specification in src/doc/unstable-book/src/library-features/asm.md to include the new register classes?

westernmagic · 2020-05-23T00:01:24Z

This is false, I checked on NVPTX and values are not zero-extended. The upper bits are undefined. https://rust.godbolt.org/z/9yU64A

They are, implicitly:

A destination register wider than the specified type may be used. The value loaded is sign-extended to the destination register width for signed integers, and is zero-extended to the destination register width for unsigned and bit-size types. See Table 25 for a description of these relaxed type-checking rules.

https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-ld
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#operand-size-exceeding-instruction-type-size

Since clang gives the registers bit sizes, they are all zero extended in this implementation.

Here's a small clang experiment as proof:

#include <cstdint>
#include <cstdio>

__global__ void foo(uint32_t x, uint64_t * y) {
	asm volatile (
		"mov.u64 %0, %1;"
		: "=l"(*y) : "l"(x)
	);
}

__host__ int main() {
	uint64_t y = UINT64_MAX;
	uint64_t * yd;
	cudaMalloc(&yd, sizeof(uint64_t));
	cudaMemcpy(yd, &y, sizeof(uint64_t), cudaMemcpyHostToDevice);
	foo<<<1, 1>>>(0xBEEF, yd);
	cudaMemcpy(&y, yd, sizeof(uint64_t), cudaMemcpyDeviceToHost);
	cudaFree(yd);
	printf("%lx\n", y);
}

If you wish, I can add the links I referenced to the documentation

Amanieu · 2020-05-23T00:15:55Z

Again, look at the assembly output from godbolt: https://rust.godbolt.org/z/-8yrgo

.visible .func  (.param .b64 func_retval0) foo(
        .param .b64 foo_param_0
)
{

        ld.param.u64    %rd2, [foo_param_0];
        mov.u64 %rd1, %rd2
        st.param.b64    [func_retval0+0], %rd1;
        ret;

}

The input value is not zero-extended anywhere in this code.

westernmagic · 2020-05-23T01:24:25Z

Ah, I see now. Looking at this though, the problem seems to be more complicated though: https://rust.godbolt.org/z/ojcbJV

Amanieu · 2020-05-23T01:27:11Z

This is the same as every other architecture: when you put a value into a register that is smaller than the register size, the upper bits are UNDEFINED. Since they are undefined, their exact value depends on what the optimizer decides to do.

src/doc/unstable-book/src/library-features/asm.md

src/test/assembly/asm/nvptx-types.rs

Amanieu · 2020-05-23T11:54:04Z

@bors r+

bors · 2020-05-23T11:54:05Z

📌 Commit 8706f76020e84863e8afc4e25dc014decc0f5a2f has been approved by Amanieu

bors · 2020-05-24T04:34:33Z

☔ The latest upstream changes (presumably #72516) made this pull request unmergeable. Please resolve the merge conflicts.

Co-authored-by: Amanieu d'Antras <amanieu@gmail.com>

Amanieu · 2020-05-25T15:11:19Z

@bors r+

bors · 2020-05-25T15:11:20Z

📌 Commit e18054d has been approved by Amanieu

@Amanieu

NVPTX support for new asm! This PR implements the new `asm!` syntax for the `nvptx64-nvidia-cuda` target. r? @Amanieu

@ghost

Rollup of 9 pull requests Successful merges: - rust-lang#67460 (Tweak impl signature mismatch errors involving `RegionKind::ReVar` lifetimes) - rust-lang#71095 (impl From<[T; N]> for Box<[T]>) - rust-lang#71500 (Make pointer offset methods/intrinsics const) - rust-lang#71804 (linker: Support `-static-pie` and `-static -shared`) - rust-lang#71862 (Implement RFC 2585: unsafe blocks in unsafe fn) - rust-lang#72103 (borrowck `DefId` -> `LocalDefId`) - rust-lang#72407 (Various minor improvements to Ipv6Addr::Display) - rust-lang#72413 (impl Step for char (make Range*<char> iterable)) - rust-lang#72439 (NVPTX support for new asm!) Failed merges: r? @ghost

rust-highfive assigned davidtwco May 21, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label May 21, 2020

Mark-Simulacrum assigned Amanieu and unassigned davidtwco May 21, 2020

nikic reviewed May 21, 2020

View reviewed changes

src/test/assembly/asm/nvptx-types.rs Outdated Show resolved Hide resolved

Amanieu requested changes May 21, 2020

View reviewed changes

src/test/assembly/asm/nvptx-types.rs Outdated Show resolved Hide resolved

src/librustc_target/asm/nvptx.rs Outdated Show resolved Hide resolved

src/librustc_target/asm/nvptx.rs Outdated Show resolved Hide resolved

src/test/assembly/asm/nvptx-types.rs Outdated Show resolved Hide resolved

Amanieu reviewed May 22, 2020

View reviewed changes

src/test/assembly/asm/nvptx-types.rs Outdated Show resolved Hide resolved

src/librustc_target/asm/nvptx.rs Outdated Show resolved Hide resolved

src/librustc_target/asm/mod.rs Outdated Show resolved Hide resolved

Amanieu reviewed May 23, 2020

View reviewed changes

src/doc/unstable-book/src/library-features/asm.md Outdated Show resolved Hide resolved

src/test/assembly/asm/nvptx-types.rs Outdated Show resolved Hide resolved

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 23, 2020

bors added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels May 24, 2020

westernmagic added 7 commits May 24, 2020 08:20

NVPTX support for new asm!

58fdc43

Formatted correctly

d77f73e

Minor fixes, as requested in PR review

baa801a

Deduplicated macro code

1070f08

Added comment on there being no predefined registers

6d74e09

Fixed tests

5ec6b5e

Updated documentation

ed559b3

westernmagic and others added 3 commits May 24, 2020 08:20

Corrected statement about zero-extension in docs.

70cd375

Update src/doc/unstable-book/src/library-features/asm.md

83a5cdf

Co-authored-by: Amanieu d'Antras <amanieu@gmail.com>

Added comment about static variables

e18054d

westernmagic force-pushed the master branch from 8706f76 to e18054d Compare May 24, 2020 07:16

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels May 25, 2020

RalfJung added a commit to RalfJung/rust that referenced this pull request May 29, 2020

Rollup merge of rust-lang#72439 - westernmagic:master, r=Amanieu

e31227a

NVPTX support for new asm! This PR implements the new `asm!` syntax for the `nvptx64-nvidia-cuda` target. r? @Amanieu

RalfJung mentioned this pull request May 29, 2020

Rollup of 8 pull requests #72737

Closed

RalfJung mentioned this pull request May 29, 2020

Rollup of 9 pull requests #72756

Merged

bors merged commit 3789455 into rust-lang:master May 30, 2020

Amanieu mentioned this pull request Sep 21, 2020

inline-asm rust-lang/lang-team#20

Closed

jieyouxu mentioned this pull request Dec 18, 2024

tests/assembly/asm: Remove uses of rustc_attrs and lang_items features by using minicore #134436

Merged

NVPTX support for new asm! #72439

NVPTX support for new asm! #72439

Uh oh!

Conversation

westernmagic commented May 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-highfive commented May 21, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Amanieu commented May 21, 2020

Uh oh!

westernmagic commented May 22, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Amanieu commented May 22, 2020

Uh oh!

westernmagic commented May 23, 2020

Uh oh!

Amanieu commented May 23, 2020

Uh oh!

westernmagic commented May 23, 2020

Uh oh!

Amanieu commented May 23, 2020

Uh oh!

Uh oh!

Uh oh!

Amanieu commented May 23, 2020

Uh oh!

bors commented May 23, 2020

Uh oh!

bors commented May 24, 2020

Uh oh!

Amanieu commented May 25, 2020

Uh oh!

bors commented May 25, 2020

Uh oh!

Uh oh!

westernmagic commented May 21, 2020 •

edited

Loading