The future of TaintedString #24

dom96 · 2017-11-28T16:09:34Z

Currently TaintedString is implemented somewhat inconsistently throughout the stdlib. The question is what should we do about that?

There are a number of options:

Remove TaintedString and tainted mode from the language
Go through the stdlib and use TaintedString consistently anywhere
- This would be very disruptive and I personally don't think it's a good idea.
(Your other option here, please suggest it)

What does everyone think?

andreaferretti · 2017-11-28T16:41:01Z

Ideally 2, but if it is too much work I guess 1 is the only option.

Actually, I don't think this should be too much work. Pure string procs can stay as they are, and be borrowed when needed. The only thing to pay attention to is that the IO routines return TaintedString instead of string - there should not be many of those

Araq · 2017-11-28T17:01:30Z

I'm torn. It's a nice feature. It never found any real bugs.

ghost · 2017-11-28T19:44:19Z

Usage of ReadIoEffect is as inconsistent in the stdlib as TaintedString. This may be problematic as their use often overlaps. For example, memfiles uses TaintedString but there's no sign of any effects tracking there.

For consistency, all stdlib I/O operations should have effects tracking tagged on their procs anyway. Perhaps the goal of a taint mode or any sort of extra-nagging about dangerous I/O by the compiler is best expressed as opt-in of existing effects tracking rather than a distinct type.

Anyway you slice it, I/O needs to be consistently tagged in the stdlib irrespective.

Araq · 2017-11-28T20:04:50Z

For example, memfiles uses TaintedString but there's no sign of any effects tracking there.

The pointer can be used for reading or writing, what effect should it be? Pointers don't have effects, procs do.

andreaferretti · 2017-11-29T09:06:20Z

Well, then procs such as memfiles.open should have IO effects

Araq · 2017-11-29T09:50:06Z

What I said about TaintedString applies to every tags effect:

It's a nice feature. It never found any real bugs.

dom96 · 2017-11-29T10:38:50Z

It's a nice feature. It never found any real bugs.

I would argue it's because we haven't properly adopted these effects. I've always been waiting for this feature to be finished before starting to use it.

Araq · 2017-11-29T13:58:05Z

What does it mean to be finished?

saem · 2020-12-24T10:31:58Z

Commenting here because I saw a semi-recently updated PR about deprecating it. I really like the idea of TaintedString, it would be a shame to see them go (depends). With Nim I've not been writing web apps or lots of untrusted input handing, nor have I had to go through heavy technical security audit. I do know taint tracking would help in those cases and have used various tools to achieve it, but haven't directly experienced it with Nim.

If I get to suggest an alternative option, we can already signal hints, warnings, and errors at compile time. I think taint tracking could be thought of as a specific case of tagging a value's type and then optionally making compile time assertions about it -- don't mix with other strings or taint them as well. Then this analysis/mode can be turned on and off rather than simply taint mode.

Moreover, I don't think it's simply strings, being able to tag types (compile time state) indicating yes I have or haven't done something (tainted string input usually being validation/sanitization) is likely some of the completeness of the effects system people are after. Whatever my program is it usually has a major phase(s) where some invariant must hold, webapps examples:

escape input
http/url encode output, etc...
ensure tenant id comes from a trusted source

A bunch of this could be done by distinct types but the fact that you don't want to necessarily enforce this everywhere or wrap and unwrap when going in and out of 3rd party code is a selling point. Maybe this is all sufficiently facilitated through existing facilities, however.

Writing this post gave me the idea of seeing if I can start tracking effects of using nim/nimsuggest/nimble etc in the vscode extension and perhaps be able to answer the completeness question, because though it feels incomplete, I can't precisely say why either.

disruptek · 2020-12-27T02:07:57Z

TaintedString is good. Can we vote on keeping it and close this issue for good?

Araq · 2020-12-27T14:05:07Z

There is quite some overlap with dedicated types for SQL/HTML/etc generation which provides about the same security benefits and is much easier to use and doesn't introduce yet another compiler switch. So IMO we should accept this RFC and remove TaintedString.

mratsim · 2021-01-01T17:14:32Z

I like Tainted String but they shouldn't be a compiler switch.

At the very least we should have a tutorial on how to write a parser with security/IO effect in mind. Right now Tainted String feels forced and contrary to Haskell, there is no dozens of explainers on why IO monads are great.

A more advanced tutorial would be a networking app with different stages of validation modeled in the type system via distinct types. For example in Nimbus (https://github.com/status-im/nimbus-eth2/blob/644c17f/docs/block_validation_flow.md) we have the following security stages for blocks from the blockchain:

Untrusted: raw block from the network
SigVerified: the cryptographic signature is correct but we didn't check the block logic
TransitionVerified: the block logic is correct but we didn't check the cryptographic signature
Trusted: we checked both logic and cryptographic signature.

Araq · 2021-01-15T09:58:30Z

I'm using my BDFL powers this time, TaintedString should go. Every string is untrusted, use type LettersOnly = distinct string etc. for trusted strings. We did this for SQL, it worked well enough and is not yet another compiler switch that in practice cannot be used anyway as it defaults to off and so practically no library supports it.

timotheecour · 2021-01-16T04:21:10Z

closed via nim-lang/Nim#15423

nim-lang/RFCs#24

narimiran transferred this issue from nim-lang/Nim Jan 2, 2019

Sam647254 mentioned this issue Dec 26, 2019

streams cannot compile with taint mode enabled nim-lang/Nim#12968

Closed

metagn mentioned this issue Nov 21, 2020

Deprecate TaintedString #287

Closed

FedericoCeratto mentioned this issue Jan 1, 2021

Deprecate TaintedString nim-lang/Nim#15423

Merged

Araq added the Accepted RFC label Jan 15, 2021

timotheecour closed this as completed Jan 16, 2021

sid-code added a commit to sid-code/nmoo that referenced this issue Aug 9, 2023

TaintedString -> string

b082b98

nim-lang/RFCs#24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The future of TaintedString #24

The future of TaintedString #24

dom96 commented Nov 28, 2017

andreaferretti commented Nov 28, 2017

Araq commented Nov 28, 2017

ghost commented Nov 28, 2017

Araq commented Nov 28, 2017

andreaferretti commented Nov 29, 2017 •

edited

Loading

Araq commented Nov 29, 2017

dom96 commented Nov 29, 2017

Araq commented Nov 29, 2017

saem commented Dec 24, 2020

disruptek commented Dec 27, 2020

Araq commented Dec 27, 2020

mratsim commented Jan 1, 2021

Araq commented Jan 15, 2021

timotheecour commented Jan 16, 2021

The future of TaintedString #24

The future of TaintedString #24

Comments

dom96 commented Nov 28, 2017

andreaferretti commented Nov 28, 2017

Araq commented Nov 28, 2017

ghost commented Nov 28, 2017

Araq commented Nov 28, 2017

andreaferretti commented Nov 29, 2017 • edited Loading

Araq commented Nov 29, 2017

dom96 commented Nov 29, 2017

Araq commented Nov 29, 2017

saem commented Dec 24, 2020

disruptek commented Dec 27, 2020

Araq commented Dec 27, 2020

mratsim commented Jan 1, 2021

Araq commented Jan 15, 2021

timotheecour commented Jan 16, 2021

andreaferretti commented Nov 29, 2017 •

edited

Loading