Skip to content

Specify UTF-8 as the character set #17

@sunfishcode

Description

@sunfishcode

Continuing the discussion from bytecodealliance/wasmtime#86:

WASI currently doesn't document the character sets used for filesystem paths, command-line arguments or environment variables.

Two high-level strategies have been proposed:

  • Just use UTF-8, and say that WASI can't directly interact with non-UTF-8-encodable strings from the outside world. Where needed, provide escape-hatch features in the API (eg., you can't open a file with an unencodable name by name, but you can get to it by iterating through a directory).
  • Use uninterpreted byte seqeunces, and then additional functions for translating to and from UTF-8, as described here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions