Delegate address space allocation to the VM #306

AndrewScheidecker · 2015-08-20T00:12:45Z

Forked from the discussion on dynamically linked data segments (#302):
I think what @jfbastien said is about as good as you can do within the current memory model, but to me it seems inevitable that wasm must abandon the idea that a module has absolute control over its address space. I think a model closer to OS processes can be efficiently polyfilled and will more easily accommodate dynamic linking.

Some things we can take from OSes:

Processes as isolated address spaces.
Modules don't get their own address space, or any isolation from other modules running in the same process, or from the OS.
The OS manages the address space of the process, and the module must go through it for page-level allocations.

Some things we shouldn't take from OSes:

The idea of modules being loaded at static addresses.
You only have control over the direct dependencies of your module, making it easy to get conflicts in indirect dependencies. I think that can be solved by a process-wide dictionary from semantic names (e.g. libc v34) to URIs, but I don't want to derail this thread with that half-baked idea.

If we applied these ideas to WebAssembly:

We can define a wasm process with an address space isolated from other processes. Make minimal guarantees about the address space: maybe just a minimum size and a protected first page.
Define a platform API for allocating virtual address space and committing physical pages to it. mmap(pointer?,I64) -> pointer and munmap(pointer,I64) -> bool are all that's really needed.
The platform loader needs to cooperate with the map/unmap API to reserve address space for module data segments.

This could all be implemented efficiently in the polyfill as a page allocator for the asm.js linear memory. The polyfill could also require the application to provide an initial size for linear memory, a hint that would no longer be necessary for the wasm module itself.

I expect native implementations would reserve a fixed range of addresses for a wasm process, and generate memory access code using an immutable base and bounds check. However you do it, supporting a true 64-bit address space would likely require a separate OS process for each wasm process, which has its own implications for things like APIs to talk to the browser.

Thoughts?

jfbastien · 2015-08-20T01:00:48Z

Thanks for writing this up. I think you're asking questions we've tried to address before, but haven't resolved yet. I'll ask some questions and point at previous discussions to make sure we're on the same page. I'm not shooing you away! Just trying to speak the same language :-)

The idea of modules being loaded at static addresses.

Could you clarify what you mean? Is that the OS process' virtual address space (shared with the DOM and JS engine), or is that the view the wasm module has of its own address space (effectively an extra layer of virtual addressing).

You only have control over the direct dependencies of your module, making it easy to get conflicts in indirect dependencies. I think that can be solved by a process-wide dictionary from semantic names (e.g. libc v34) to URIs, but I don't want to derail this thread with that half-baked idea.

#53 and linked issues may be relevant.

Define a platform API for allocating virtual address space and committing physical pages to it. mmap(pointer?,I64) -> pointer and munmap(pointer,I64) -> bool are all that's really needed.

#227 and linked issues may be relevant.

supporting a true 64-bit address space would likely require a separate OS process for each wasm process, which has its own implications for things like APIs to talk to the browser.

Amen.

AndrewScheidecker · 2015-08-20T13:39:32Z

The idea of modules being loaded at static addresses.

Could you clarify what you mean? Is that the OS process' virtual address space (shared with the DOM and JS engine), or is that the view the wasm module has of its own address space (effectively an extra layer of virtual addressing).

I was referring to how you can give an ELF or PE module a preferred base address. It's useful for native code to avoid relocations and maximize page sharing between processes, but only applies to WebAssembly data segments. IMO those should be loaded at random addresses anyway, so there's no escaping the relocations.

Define a platform API for allocating virtual address space and committing physical pages to it. mmap(pointer?,I64) -> pointer and munmap(pointer,I64) -> bool are all that's really needed.

#227 and linked issues may be relevant.

I think the mmap and munmap stuff already proposed is good, although given that they no longer correspond to the overloaded semantics of POSIX mmap, they should perhaps be called something more specific, like commit_pages or decommit_pages.

In comparison to your proposal in the data segment linking thread (#302 (comment)), my proposal really only differs as the title suggests: by giving the VM the exclusive privilege of address space allocation, rather that allowing a toolchain/module to provide their own page allocator. The rest of the differences follow from that:

The "linear memory" accessed by wasm modules becomes an initially uncommitted linear address space.
The module must call the platform's mmap to commit pages in its address space before using them.
The wasm platform doesn't have syscalls for resizing the address space. The address space is just taken to be 32-bits or 64-bits, though mmap calls may fail to allocate more than a fraction of that address space.
Since the platform controls the allocation of the wasm module's address space, it can allocate pages for dynamically loaded data segments without going through an allocator provided by the wasm module.

Amen.

NaCl certainly proves a separate OS process is useful for efficient memory isolation. It looks like NaCl talks to the browser through asynchronous messages. Is that acceptable for WebAssembly? It would be possible to provide synchronous control flow between WASM and JS even if the WASM was running in a separate process, but IMO it would be preferable to provide an optional library for emulating that on top of a low-level interface that's fundamentally async.

kg · 2015-08-20T17:27:39Z

NaCl certainly proves a separate OS process is useful for efficient memory isolation. It looks like NaCl talks to the browser through asynchronous messages. Is that acceptable for WebAssembly? It would be possible to provide synchronous control flow between WASM and JS even if the WASM was running in a separate process, but IMO it would be preferable to provide an optional library for emulating that on top of a low-level interface that's fundamentally async.

For browser scenarios, synchronous re-entrant calls are a requirement, i.e. JS -> wasm -> JS -> wasm. You could do this via cross-process remoting, but it's questionable whether it would be robust or performant.

It's possible that over time the importance of those sync calls will be diminished because wasm will be able to access key APIs directly. In the MVP, though, you'll be bouncing out to JS for basically everything. In-process is also a (near-)requirement for direct DOM interaction, and that seems like something that is wanted for various use cases.

jfbastien · 2015-08-20T18:04:40Z

I think we have a small disconnect: we're designing WebAssembly with the assumption that it doesn't control the entire process' address space. It shares it with other code, and only gets a small contiguous section of virtual memory. The wasm program doesn't know what that virtual address is, it sees its base as zero, and knows its extent, but cannot access other addresses in the same process because it's highly untrusted and therefore needs to be sandboxed.

On 32-bits we further want to avoid exhausting virtual address space in the process, or causing excessive fragmentation.

The "linear memory" accessed by wasm modules becomes an initially uncommitted linear address space.

I think we agree on this: we don't want to force physical reservation if we can avoid it, we only force virtual address space allocation. wasm doesn't necessarily mandate this, but doesn't prevent VMs from doing so. We do mandate that memory be zero initialized, but that can be done lazily on commit.

The module must call the platform's mmap to commit pages in its address space before using them.

I think we agree here too?

The wasm platform doesn't have syscalls for resizing the address space. The address space is just taken to be 32-bits or 64-bits, though mmap calls may fail to allocate more than a fraction of that address space.

That's where we disagree, but that's because we can't waste virtual address space in 32-bit processes.

Since the platform controls the allocation of the wasm module's address space, it can allocate pages for dynamically loaded data segments without going through an allocator provided by the wasm module.

In general we're trying to use ideas from the extensible web manifesto: provide the lowest-level capabilities, build tooling on top of it, but let developers to something else that we didn't expect. In this case it'll be nice for the tooling to do ASLR by default, but some developers may want something else, or we may simply get it wrong! Some applications will want to get clever with memory allocation locations (e.g. asan).

I'm hoping my explanations of what we assume make sense. Maybe we're assuming the wrong things :-)

lukewagner · 2015-08-20T18:23:27Z

@AndrewScheidecker Just to add on to what @jfbastien already said, once one agrees on the contiguity requirement, then I think what you're proposing is equivalent to:

A wasm module declares a small minimum memory size (perhaps only enough to hold the .data section)
The impl of the library's mmap bumps the memory size as necessary. Since OSes only commit physical pages to virtual address space lazily, only what is touched has cost (increases RSS).
When mprotect is added, the mmap impl could be more aggressive and PROT_NONE any memory that isn't supposed to be user-visible. We could even provide an overload of resize_memory that takes protection flags to change the default state of memory (saving a few syscalls as an optimization). However, none of this would change the actual cost model, it'd just catch programmer bugs. PROT_NONE is neither necessary nor sufficient to minimize RSS.

You do make the good point that I'm often using "module" in a fuzzy way that blurs the distinction between static code and a loaded instance. When I'm careful, I try to say "module instance" :) (FWIW, ES6 has Module Records and Module Environment Records.) Since we've had this confusion before, perhaps it's worth saying "process" instead of "module instance" and/or adding a clarifying section to Modules.md.

AndrewScheidecker · 2015-08-21T19:12:22Z

I agree with the principal that the design should allow an implementation to embed a WASM address space in the browser process's address space. That's obviously necessary for the polyfilled MVP at least. I do want to make sure that it's practical for a browser implementation to execute a WASM process in a separate OS process, but that's not dependent on this issue.

@jfbastien That's where we disagree, but that's because we can't waste virtual address space in 32-bit processes.

I will concede that the peak address space hint needs to be part of WASM itself, rather than being demoted to a polyfill parameter as I suggested. I was assuming most browsers would want to put WASM processes in a separate OS process, but that looks impractical with the synchronous JS interop.

@jfbastien In general we're trying to use ideas from the extensible web manifesto: provide the lowest-level capabilities, build tooling on top of it, but let developers to something else that we didn't expect.

I think this makes sense, but it's a question of what's practically extensible. If you require the loader to ask the WASM process where to put a data segment, then it's fine to let the process do ASLR or whatever it wants to do. Making the loader call into the process is a huge can of worms (see DllMain), so I think that's an argument against doing it that way.

@lukewagner
1.A wasm module declares a small minimum memory size (perhaps only enough to hold the .data section)

To be explicit about the terms I use below:

OS-reserved: virtual pages that have been reserved so subsequent reservations will not use the same pages, but access to these pages will result in a page fault. equivalent to mmap(PROT_NONE) or VirtualAlloc(MEM_RESERVE).
WASM-reserved: like an OS-reservation, but implemented by the WASM platform to allocate pages from the contiguous address space exposed to WASM code. Distinct from OS-reservations to allow implementations to make an OS-reservation for the peak address space needed by a WASM process while still requiring the WASM process to reserve pages within that address space. Any WASM-reserved page is also OS-reserved.
Committed: virtual pages that can be accessed, whether they are backed by physical pages, locations in the page file, or will be assigned a freshly zeroed physical page on first access. equivalent to mmap(PROT_READWRITE) or VirtualAlloc(MEM_COMMIT). Any committed page is also WASM-reserved.

I think it makes sense for the WASM process to declare how its peak address space up front for the benefit of implementations where address space is a precious resource. Such implementations would then be able to OS-reserve a contiguous block of pages for that peak address space to ensure the WASM address space base and size are immutable. For the polyfill, the OS-reservation corresponds to a statically sized ArrayBuffer, and the page fault semantics could be ignored.

The platform would also WASM-reserve+commit some pages for the data segments of modules loaded to start the process, but otherwise when control is transferred to the WebAssembly process it shouldn't assume any more committed pages than that. To start making dynamic allocations, it must call mmap to commit additional pages.

So the module doesn't need to declare a minimum memory size, that's implicit in the size of the .data and .bss sections.

2.The impl of the library's mmap bumps the memory size as necessary. Since OSes only commit physical pages to virtual address space lazily, only what is touched has cost (increases RSS).

By library, do you mean something compiled into the WASM process, or something defined by the WASM platform? By "delegate address space allocation to the VM" I mean that the set of WASM-reserved pages is managed by the WASM platform rather than the WASM process.

Here I believe you're using commit to mean "has a physical page associated", in contrast to how I've been using it. I've been using it to include virtual pages backed by not only physical pages but also pagefile or lazily initialized zeroes; anything that is committed as far as a user-space process is concerned. I'm proposing that the WASM platform be allowed (but not required for at least the polyfill) to start your process out with most of the address space uncommitted in the sense that accessing it will result in an unhandled page fault.

3.When mprotect is added, the mmap impl could be more aggressive and PROT_NONE any memory that isn't supposed to be user-visible. We could even provide an overload of resize_memory that takes protection flags to change the default state of memory (saving a few syscalls as an optimization). However, none of this would change the actual cost model, it'd just catch programmer bugs. PROT_NONE is neither necessary nor sufficient to minimize RSS.

I'm not worried about reducing the amount of physical memory used, but rather making sure this will all work with dynamic linking, or asan as @jfbastien mentioned. Making the address space PROT_NONE by default isn't important beyond that an implementation should be allowed to do it, and WASM processes should go through a platform-level mmap to commit pages instead of expecting to be able to read and write anywhere in its address space.

jfbastien · 2015-08-21T20:16:19Z

I'm not worried about reducing the amount of physical memory used.

We're pretty worried about this because mobile platforms don't have that much memory to spare. These platforms otherwise have to resort to an OOM killer if they don't expose an API where developers can relinquish memory.

lukewagner · 2015-08-22T01:09:18Z

By library, do you mean something compiled into the WASM process

Yes, as @jfbastien explained.

I think it makes sense for the WASM process to declare how its peak address space up front for the
benefit of implementations where address space is a precious resource.

That is already the case.

I'm proposing that the WASM platform be allowed (but not required for at least the polyfill) to start
your process out with most of the address space uncommitted in the sense that accessing it will
result in an unhandled page fault.

That is already the case, assuming the module declares a small initial heap size.

lukewagner · 2015-10-23T18:23:05Z

There is now a lot of consensus around the current model of a contiguous address space, grow_memory, with protection/mapping modified by memory functions in the future. I'd like to close this issue since it's rather general, but feel free to open new issues for specific concrete modifications to the current model in AstSemantics.md.

lukewagner mentioned this issue Aug 20, 2015

How do data segments interact with dynamic linking? #302

Closed

AndrewScheidecker mentioned this issue Aug 26, 2015

Idea for getting rid of extra indirection level for calls through virtual function table #312

Closed

dschuff added the tables / dynamic linking label Aug 26, 2015

jfbastien mentioned this issue Sep 8, 2015

Negotiated heap size and methods of resizing the heap. #331

Closed

AndrewScheidecker mentioned this issue Sep 9, 2015

Pass a start-of-usable-memory to the module. #334

Closed

lukewagner closed this as completed Oct 23, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delegate address space allocation to the VM #306

Delegate address space allocation to the VM #306

AndrewScheidecker commented Aug 20, 2015

jfbastien commented Aug 20, 2015

AndrewScheidecker commented Aug 20, 2015

kg commented Aug 20, 2015

jfbastien commented Aug 20, 2015

lukewagner commented Aug 20, 2015

AndrewScheidecker commented Aug 21, 2015

jfbastien commented Aug 21, 2015

lukewagner commented Aug 22, 2015

lukewagner commented Oct 23, 2015

Delegate address space allocation to the VM #306

Delegate address space allocation to the VM #306

Comments

AndrewScheidecker commented Aug 20, 2015

jfbastien commented Aug 20, 2015

AndrewScheidecker commented Aug 20, 2015

kg commented Aug 20, 2015

jfbastien commented Aug 20, 2015

lukewagner commented Aug 20, 2015

AndrewScheidecker commented Aug 21, 2015

jfbastien commented Aug 21, 2015

lukewagner commented Aug 22, 2015

lukewagner commented Oct 23, 2015